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Abstract  of  Dissertation  Presented  to  the  Graduate  School 
of  the  University  of  Florida  in  Partial  Fulfillment  of  the 
Requirements  for  the  Degree  of  Doctor  of  Philosophy 

ANALYSIS  OF  MULTIVARIABLE  CONTROL  SYSTEMS  IN  THE  PRESENCE 
OF  STRUCTURED  UNCERTAINTIES 

By 

ROBERT  JEFFREY  NORRIS 
August  1990 

Chairman:  Dr.  Haniph  A.  Latchman 
Major  Department:  Electrical  Engineering 

^  An  analysis  of  the  stability  properties  of  uncertain  multivariable  control  systems 
in  the  frequency  domain  is  presented.  Necessary  and  sufficient  stability  criteria  are 
reviewed  along  with  singular  value  scaling  techniques  for  characterizing  permissible 
uncertainties.  Such  scaling  methods  have  become  widely  accepted  tools  for  the  anal¬ 
ysis  of  control  systems  in  the  presence  of  structured  uncertainties.  Included  in  this 
study  are  the  general  block  similarity  scaling  techniques  advanced  by  Doyle  and  the 
nonsimilarity  scaling  approach  of  Kouvaritakis  and  Latchman. 


For  element-by-element  structured  uncertainties,  both  scaling  methods  reliably 
compute  Doyle’s  structured  singular  value,  fi,  which  provides  an  indication  of  sys¬ 
tem  stability.  However,  the  similarity  scaling  formulation  has  the  disadvantage  of 

^Cuo.rSfN'  _ _ 

rf  \  77 - .  . 

expanding  an  n  x  n  system  matrix  to  an  w  x  m  matrix  requiring  n*  —  1  optimization 

*h)  n 

variables  to  compute  Using  nonsimilarity  scaling,  the  system  size  remains  n  with 
the  additional  benefit  of  requiring  only  2 (n  —  1)  optimization  variables.  _____ — 7)  (3 
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The  results  of  this  work  show  that  for  scalar  uncertainties,  the  structure  may  be 
exploited  to  yield  a  similarity  scaling  method  which  requires  no  more  than  the  2(n  —  1) 
optimization  variables  needed  for  nonsimilarity  scaling.  Substantial  savings  in  floating 
point  operations  are  observed  for  various  system  sizes  enhancing  the  capability  of  this 
method  for  analysis  and  iterative  design.  A  similar  reduction  in  optimization  variables 
is  shown  to  hold  for  the  important  class  of  general  block  structured  uncertainties.  This 
reduction  leads  to  a  complete  solution  for  the  2x2  block  uncertainty  problem. 

In  addition,  a  direct  relationship  between  the  similarity  and  nonsimilarity  scaling 
matrices  is  presented.  This  direct  relationship,  coupled  with  a  reduction  of  opti¬ 
mization  variables  like  that  shown  for  similarity  scaling,  provides  a  more  efficient 
implementation  of  the  Fan  and  Tits  vector  optimization  method  for  computing  fi. 

For  systems  with  repeated  maximum  singular  values,  both  similarity  and  nonsim¬ 
ilarity  scaling  procedures  generally  fail  to  calculate  //  exactly.  By  invoking  the  major 
principal  direction  alignment  principle  of  Kouvaritakis  and  Latchman,  a  2 (q  —  T)- 
dimensional  optimization  problem  is  proposed  for  estimating  //  where  q  represents 
the  multiplicity  of  the  maximum  singular  value.  In  all  numerical  experience  to  date, 
these  estimates  and  the  corresponding  exact  values  of  //  have  agreed  within  three 
significant  figures.  Application  of  these  techniques  to  design  is  discussed. 


CHAPTER  1 

INTRODUCTION  TO  SYSTEM  UNCERTAINTY 
1.1  Introduction 

The  first  task  in  any  control  design  is  the  modelling  process  through  which  a  math¬ 
ematical  representation  of  the  system  is  developed.  This  model  may  be  constructed 
theoretically  through  some  knowledge  of  the  physical  laws  involved  or  empirically 
by  characterizing  experimental  data  collected  over  the  range  of  operating  conditions 
[1].  Theoretical  models  may  assume  linearity  around  some  nominal  operating  point 
while  neglecting  nonlinear  factors  that  may  dominate  away  from  this  nominal  point. 
Similarly,  the  range  of  operating  conditions  chosen  for  the  empirical  model  may  not 
be  sufficiently  exhaustive  to  adequately  characterize  system  behavior.  For  both  mod¬ 
elling  approaches,  the  effects  of  component  aging,  temperature  and  pressure  varia¬ 
tions,  manufacturing  tolerances  and  countless  other  unknown  factors  combine  to  cast 
doubt  on  the  accuracy  of  a  particular  system  model.  It  is  therefore  critical  to  ensure 
that  a  controller  designed  for  a  nominal  model  does  not  lose  required  stability  and 
performance  properties  when  applied  to  the  real-world  system  [2], 

For  Single-Input  Single-Output  (SISO)  systems,  model  uncertainty  problems  have 
typically  been  addressed  by  ensuring  that  adequate  gain  and  phase  margins  exist 
throughout  the  range  of  operating  conditions.  These  margins  could  then  absorb 
the  detrimental  effects  of  the  uncertainties  without  sacrificing  stability  requirements. 
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For  Multi-Input  Multi-Output  (MIMO)  systems,  the  classical  definitions  of  gain  and 
phase  margins  do  not  apply  because  of  complex  system  interactions  [3].  Examples  of 
multivariable  systems  include  chemical  plants,  nuclear  facilities  and  high  performance 
aircraft  where  unstable  operation  may  resuit  in  the  loss  of  expensive  equipment  or 
even  lives  [4,  5]. 

This  dissertation  reviews  some  of  the  current  frequency  domain  techniques  em¬ 
ployed  in  the  analysis  of  multivariable  systems  with  uncertain  plant  models.  Of  par¬ 
ticular  importance  are  those  approaches  that  address  structured  uncertainties  because 
they  generally  produce  a  much  less  conservative  stability  analysis  than  those  dealing 
strictly  with  unstructured  uncertainties.  Alternative  formulations  of  these  approaches 
are  developed  offering  substantial  improvements  in  computational  efficiency. 

Following  this  introduction  is  a  brief  discussion  of  the  historical  treatment  of  un¬ 
certainties  for  feedback  systems  as  well  as  a  list  of  standard  notation  used  throughout 
this  work.  Chapter  2  summarizes  the  classical  Nyquist  stability  criterion  for  SISO 
systems  as  well  as  the  generalized  nyquist  criterion  for  MIMO  systems.  Singular  value 
techniques  that  provide  sufficient  stability  criteria  are  reviewed  in  Chapter  3  while 
Chapter  4  covers  various  scaling  techniques  that  recover  the  necessity  of  the  stability 
criteria  under  most  conditions.  The  main  contributions  of  this  dissertation  appear 
in  Chapters  5,  6,  7,  and  8  along  with  several  examples  that  illustrate  the  advantages 
offered  by  these  new  results.  Finally,  Chapter  9  summarizes  the  work  and  introduces 
some  of  the  promising  directions  for  future  robust  control  research. 
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1.2  Historical  Treatment  of  Uncertainty 

One  of  the  first  efforts  to  account  for  model  uncertainties  during  control  system 
design  was  by  H.  S.  Black  of  Bell  Laboratories  in  1927  [6],  Black  introduced  the 
concept  of  feedback  to  eliminate  amplifier  distortion  in  long  distance  telephone  com¬ 
munications.  Amplifiers  of  that  time  required  hourly  adjustments  of  bias  currents 
resulting  in  unacceptable  manpower  demands  [7],  Black’s  feedback  amplifier  proved 
to  be  almost  completely  immune  to  amplifier  uncertainties  caused  by  nonlinearities 
and  temperature  and  aging  changes.  One  drawback  to  the  new  amplifier  design  was 
an  occasional  self-oscillation  or  “singing”  at  certain  loop  gain  settings.  In  response 
to  this  sometimes  severe  problem,  another  Bell  Laboratories  scientist,  II.  Nyquist, 
developed  the  now  famous  Nyquist  stability  criterion  relating  closed  loop  system  sta¬ 
bility  to  open  loop  frequency  response  information  [8].  From  this  stability  theory 
came  the  pioneering  work  of  H.  W.  Bode  concerning  the  issue  of  stability  robustness 
in  controller  design  [9].  The  resulting  concepts  of  gain  and  phase  margins  for  SISO 
systems  provided  the  design  engineer  with  an  measure  of  stability  in  the  presence  of 
uncertainties. 

The  Nyquist  stability  criterion  is  recognized  as  a  major  advance  in  the  design  of 
stable  SISO  control  systems  and  it  forms  the  basis  of  classical  control  theory.  How¬ 
ever,  events  of  the  1950s  and  1960s  shifted  the  emphasis  of  control  theory  research 
from  the  frequency  domain  to  the  time  domain  representation  as  worldwide  atten¬ 
tion  focused  on  manned  and  unmanned  rocket  guidance  and  control.  With  almost 
unlimited  budgets  and  well  defined  models,  the  problems  associated  with  system  un- 
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certainty  became  (at  least  temporarily)  less  important.  However,  the  late  1970s  saw 
renewed  interest  in  frequency  domain  analysis  and  design  as  the  precise  plant  models 
required  for  optimal  state-space  methods  simply  could  not  be  determined  for  many 
important  systems  due  to  numerous  plant  uncertainties  [10]. 

Finally,  in  1977  a  multivariable  analogue  to  the  Nyquist  stability  criterion  known 
as  the  characteristic  loci  method  (CLM)  vas  developed  [11].  Introduced  by  Mac- 
Farlane  and  Postlethwaite,  the  CLM  utilizes  frequency  dependent  plots  of  transfer 
matrix  eigenvalues  (called  loci)  to  characterize  MIMO  system  stability.  In  its  original 
formulation,  the  CLM  does  not  provide  easily  discernable  information  on  stability 
margins  because  slight  perturb  :ons  in  the  transfer  matrix  can  cause  large  shifts  in 
the  loci.  This  difficulty  led  to  the  use  of  singular  value  techniques  as  a  means  of 
extending  the  CLM  to  account  for  system  uncertainty. 

For  unstructured  uncertainties,  wh.»\:  presume  no  knowledge  of  an  uncertainty 
other  than  a  norm  bound,  Doyle  and  Stein  have  shown  that  singular  value  bounds 
can  provide  necessary  and  sufficient  stability  conditions  for  uncertain  multivariable 
systems  [12].  Unfortunately,  this  formulation  fails  to  take  advantage  of  any  available 
knowledge  concerning  the  possibly  well-defined  structure  of  the  uncertainty.  The 
controller  must  then  accommodate  uncertainties  that  may  be  physically  impossible 
leading  to  an  overly  conservative  design  [13]. 

Safonov  applied  the  concept  of  similarity  scaling  to  singular  value  analysis  for 
the  case  of  diagonally  structured  scalar  uncertainties  [14].  This  scaling  idea  was 
then  extended  by  Doyle  to  consider  the  important  class  of  general  block  structured 
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perturbations  which  allow  norm  bounded  uncertainty  blocks  to  be  of  arbitrary  dimen¬ 
sion  [15].  Doyle’s  concept  of  the  “structured  singular  value”  (denoted  //)  as  a  means 
of  determining  the  set  of  permissible  structured  uncertainties  for  system  stability  has 
become  a  widely  accepted  tool  in  the  design  of  robust,  multivariable  control  systems. 
Additional  scaling  techniques  such  as  the  nonsimilarity  approach  of  Kouvaritakis  and 
Latchman  have  evolved  to  address  element-  by-element  structured  uncertainties  and, 
in  fact,  both  similarity  and  nonsimilarity  scaling  methods  have  been  shown  to  pro¬ 
duce  necessary  and  sufficient  stability  conditions  in  terms  of  //  for  most  systems  with 
structured  uncertainties  [16,  17]. 

These  singular  value  techniques  have  recently  been  extended  to  the  areas  of  H°° 
and  //-Synthesis:  both  of  which  allow  a  great  deal  of  flexibility  in  the  satisfaction  of 
controller  design  requirements  while  retaining  stability  and  performance  robustness 
properties  [18,  19].  The  main  emphasis  of  this  work  will  therefore  be  directed  to¬ 
wards  the  singular  value  analysis  techniques  including  new  results  that  enhance  their 
application  in  the  analysis  of  robust  multivariable  control  systems. 

1.3  Notation 

The  following  notational  convention  will  apply  unless  otherwise  stated. 

CnXm  :  The  set  of  complex  matrices  with  n  rows  and  m  columns. 
s^nxm  .  The  sej.  0f  reaj  malices  vvith  n  rows  and  m  columns. 

3  :  The  square  root  of  -1. 

Im(a )  :  The  imaginary  portion  of  complex  element  a. 

,Re(a )  :  The  real  portion  of  complex  element  a. 
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arg(a)  :  The  argument  of  complex  element  a. 

|a|  :  The  absolute  value  of  element  a. 

a  :  The  complex  conjugate  of  scalar  a. 

Ah  :  The  complex  conjugate  transpose  of  matrix  A. 

||A||P  :  The  p-norm  of  matrix  A  (p  =  2  unless  noted  otherwise). 

||A|[/r  :  The  Frobenius  norm  of  matrix  A. 

A+  :  Matrix  A  with  elements  replaced  with  their  absolute  values. 

A-1  :  Inverse  of  matrix  A. 

det{A}:  The  determinant  of  matrix  A. 

A,(A)  :  The  zth  eigenvalue  of  matrix  A. 
p(A)  :  The  spectral  radius  of  A. 

<x,-(A)  :  The  zth  singular  value  in  magnitude. 

a(A)  :  The  singular  value  with  maximum  magnitude  (a  —  crj). 

/z(A)  :  The  structured  singular  value  of  matrix  A. 

V  :  The  family  of  diagonal  matrices  with  positive,  real  entries. 

U  :  The  family  of  diagonal  unitary  matrices, 

inf  :  Infimum. 

sup  :  Supremum. 

max  :  Maximum. 

■  :  The  completion  of  a  proof  or  discussion. 


CPU  :  Central  processing  unit. 


CHAPTER  2 

FREQUENCY  DOMAIN  ANALYSIS 
2.1  Transfer  Matrix  Representation 

Any  real  world  system  can  be  characterized  by  a  relationship  between  system 
inputs  and  outputs.  Given  a  system  with  output  y(s)  and  input  u(,$),  the  transfer 
function  G(s)  relates  the  two  in  the  manner 

y{s)  =  G($Ms). 

For  nonscalar  G(s),  the  off-diagonal  elements  of  G(s)  produce  the  system  interac¬ 
tion  that  complicates  multivariable  control  systems.  The  identification  process  to 
determine  a  transfer  matrix  begins  by  injecting  known  inputs  u(s)  into  the  plant  and 
measuring  the  resulting  output  y{s).  An  alternative  approach  begins  with  the  time 
domain  state  space  equations  usually  derived  through  some  knowledge  of  the  plant's 
physics.  The  state  space  representation  may  be  expressed  as 

x(t)  =  Ax{i)  +  Bu(t),  y{t)  =  Cx(t)  Du(t) 

where  A,  B,  C,  and  D  are  real,  possibly  time  varying  matrices.  Through  the  Laplace 
transform,  the  unique  transfer  matrix  representation  may  then  be  written  as 

G(s)  =  C(sI-A)~1B  +  D. 
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Figure  2.2:  Transfer  Matrix  System  Representation. 
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2.2  Characteristic  Loci  and  the  Generalized  Nyquist  Criterion 

Two  assumptions  are  made  at  this  point  concerning  the  transfer  matrices  used 
throughout  this  work.  The  first  assumption  is  that  G(s )  is  square  and  rational. 
The  second  assumption  is  that  G(s )  contains  no  hidden  unstable  modes.  Also,  only 
unity  feedback  will  be  addressed  explicitly  although  any  uncompensated  G(s)  may 
simply  be  rewritten  as  G(s)  =  K(s)G(s)  where  K(s)  is  some  precompensator.  The 
formulation  would  then  continue  with  G(s)  instead  of  (7(s). 

Denote  g(s)  =  — j  as  the  open-loop  transfer  function  of  a  scalar  system  where 
n(s)  and  d(s)  represent  the  numerator  and  denominator  polynomials  respectively. 
Then  <7(5)  may  be  related  to  the  return  difference  transfer  function,  f(s),  by 

/W  =  l+jW  =  ^(±tAi).  (2.1) 

The  term  n(s)  +  d(s)  from  Equation  2.1  actually  corresponds  to  the  closed-loop 
pole  polynomial,  pc(s),  under  unity  feedback  while  d(s)  corresponds  to  the  open-loop 
pole  polynomial,  p0,  of  g(s).  Therefore  f(s)  may  be  rewritten  as  the  ratio  of  the 
closed-loop  to  open-loop  pole  polynomials  or 

/W  =  1+j(s)  =  M£|.  (2.2) 

Application  of  the  principle  of  the  argument  allows  the  development  of  a  graphical 
stability  test  for  Equation  2.2.  Denote  the  open  and  closed-loop  right-half  plane  poles 
as  n0  and  nc  respectively.  Mapping  g{s)  onto  the  classical  Nyquist  D-contour  requires 
that  the  number  of  clockwise  critical  point  (—1  4-  jO)  encirclements  equals  nc  —  n0. 
Closed-loop  stability  implies  that  nc  —  0  so  the  number  of  critical  point  encirclements 


for  a  stable  system  must  be  — n0.  This  result  forms  the  basis  for  the  classical  Nyquist 
stability  criterion. 

Extending  these  results  to  the  multivariable  case,  MacFarlane  and  Postlethwaite 
generalized  Equation  2.2  to  the  form 

tfei{F(s)}  =  det{[I  +  G(s)]}  = 

where  F(s)  is  the  multivariable  return  difference  matrix  and  a  is  a  scalar  constant 
[11].  Rather  than  mapping  g(s )  directly  as  in  the  SISO  case,  the  characteristic  loci, 
defined  as  the  Nyquist  D-contour  map  of  the  frequency  dependent  eigenfunctions, 
gi(s),  are  mapped.  These  eigenfunctions  are  solutions  to  the  equation 

de*{[$r,-(s)/  -  £(s)]}  =  0 

and  hence  form  algebraic  functions  of  the  elements  of  G(s).  Also,  the  gi(s)  are  analytic 
everywhere  except  where  two  or  more  of  the  $r,-(s)  coincide.  At  such  points,  the 
separate  layers  of  the  Riemann  surface  which  forms  the  domain  of  the  <7,(5)  are  simply 
joined  together  forming  closed  curves.  Once  again  the  principle  of  the  argument  may 
be  applied  to  these  closed  curves  to  form  the  basis  of  the  generalized  Nyquist  criterion 
stated  in  Theorem  1.1  [11]. 

Theorem  1.1:  A  multivariable  system  with  open-loop  transfer  matrix  G(s)  con¬ 
taining  only  controllable  and  observable  modes  will  be  stable  under  unity  feedback  if 
and  only  if  the  net  sum  of  counter-clockwise  characteristic  loci  encirclements  of  the 
critical  point  (—1  +  jO)  is  equal  to  the  number  of  right-half  plane  open-loop  poles  of 
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This  important  theorem  allows  for  a  complete  graphical  stability  test  of  multi- 
variable  systems  in  the  frequency  domain  where  no  model  uncertainty  is  present. 
Unfortunately,  SISO  concepts  of  system  robustness  based  on  gain  and  phase  mar¬ 
gins  do  not  extend  to  the  multivariable  case  as  the  eigenfunctions  may  display  large 
shifts  from  relatively  small  perturbations  in  the  elements  of  G(s).  The  next  section 
addresses  the  stability  of  systems  in  the  presence  of  uncertainties. 

2.3  Necessary  and  Sufficient  Stability  Conditions 

The  generalized  Nyquist  criterion,  developed  from  the  characteristic  loci  method, 
provides  both  necessary  and  sufficient  stability  conditions  for  the  nominal  plant  de¬ 
noted  (70(s)  where  s  =  juj  on  the  Nyquist  D  contour.  Stability  is  guaranteed  if  and 
only  if  the  number  of  counter-clockwise  critical  point  encirclements  by  the  character¬ 
istic  loci  equals  the  number  of  unstable  open  loop  poles.  Assuming  that  the  nominal 
plant  itself  is  stable  (either  alone  or  following  some  type  of  control  implementation), 
instability  in  the  perturbed  plant,  Gp(s),  implies  a  change  in  the  number  of  critical 
point  encirclements.  If  G0(s)  and  Gp(s)  have  the  same  number  of  of  open-ioop  unsta¬ 
ble  poles,  then  for  such  a  change  to  occur,  at  least  one  eigenvalue  of  Gp(s )  must  pass 
through  the  critical  point  (—1  -}-  jO).  Restricting  the  nominal  and  perturbed  plants 
to  the  same  number  of  open-loop  poles  requires  uncertainty  matrix  A  to  be  stable 
transfer  matrix.  While  this  requirement  does  limit  allowable  uncertainty  structures  to 
those  with  no  poles  in  the  right-half  plane,  Foo  has  shown  that  unstable  perturbations 
can  generally  be  decomposed  into  two  stable  perturbations  allowing  the  analysis  to 
continue  with  a  possible  increase  in  conservatism  [20]. 
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For  additive  uncertainties,  ihe  necessary  and  sufficient  stability  criterion  for  an 
uncertain  plant  Gv($ )  =  ((70(s)+A(s)),  where  A(s)  represents  a  frequency  dependent 
uncertainty  matrix,  becomes 

\(Gp(s))  #  -1. 

Substituting  back  the  nominal  plant  and  A(s),  the  criterion  may  be  written  as 

A(G0(a)  +  A (s))  ±  -1 

or 

A  {Go{s)  +  A(s)  +  7)^0.  (2.3) 

For  any  matrix  A,  the  determinant  of  A  equals  the  product  of  the  eigenvalues  of  A 
so  Equation  2.3  becomes 

de*{(Go(s)  +  A(s)  +  /)}#0 

or 

det{{G0{s)  +  /}  •  det{{I  +  a(s))-1A(s)  +  ^  0.  (2.4) 

The  nominal  plant  is  known  to  be  stable  and  hence  cannot  have  any  eigenvalues 
passing  through  the  critical  point.  Applying  this  fact,  Equation  2.4  may  be  simplified 
to  the  form 

X(I  +  M(s)A(s))  7^  0  where  M{s)  =  (I  +  G0(s))_1 


or 


A(M(s)A(s))  ±  -1. 


(2.5) 
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Typically,  some  form  of  norm  bound  on  either  the  entire  matrix  (unstructured  uncer¬ 
tainties)  or  individual  elements  (structured  uncertainties)  can  be  estimated,  leaving 
the  phase  information  to  vary  freely.  This  freedom  in  choosing  the  phase  of  A  allows 
Equation  2.5  to  be  written  as  the  necessary  and  sufficient  stability  condition 

sup /3(M(s)A(s))  <  1  (2.6) 

A(S) 

where  p  denotes  the  spectral  radius.  It  should  be  noted  that  while  the  preceding 
argument  addresses  additive  uncertainties,  a  similar  approach  can  be  applied  to  mul¬ 
tiplicative  uncertainties  where  Gp(s)  =  G0(s)(I  +  A(s)).  Multiplying  on  the  right  by 
G0(s)  gives 

Gp(s)  =  G0(s)  +  G0(s)A(s) 
or 

Gp(s)  =  G0{s )  +  A(s)  where  A(s)  =  G0(s)A(s). 

Although  Equation  2.6  provides  necessary  and  sufficient  stability  conditions  for 
uncertain  systems,  it  is  quite  difficult  to  compute  because  the  entire  range  of  A  must 
be  considered.  Furthermore,  the  solution  lacks  convexity  allowing  for  the  possibility 
of  multiple  maxima.  This  unfortunate  situation  greatly  complicates  attempts  to  find 
the  true  supremum  of  /9(M(s)A(s))  over  A(s).  Because  of  the  difficulties  involved 
in  solving  Equation  2.6,  singular  value  techniques  were  developed  to  provide  upper 
bounds  on  permissible  model  uncertainties.  Before  introducing  these  techniques,  a 
mathematical  description  of  several  important  uncertainty  classes  will  be  presented. 
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2.4  Uncertainty  Classes 

So  far,  the  frequency  dependent  uncertainty  matrix,  A(.s),  has  been  used  to 
represent  the  additive  or  multiplicative  uncertainty  present  in  control  system.  No  in¬ 
formation  has  been  provided  on  the  structure  of  A(s)  which  turns  out  to  be  quite  im¬ 
portant  in  the  actual  mathematical  analysis  of  system  stability.  Rather  than  propos¬ 
ing  one  all-encompassing  model  for  A(s)  to  account  for  system  uncertainty,  three 
representations  will  be  presented  with  varying  degrees  of  information  required  for 
each  one.  In  developing  these  three  models,  two  main  objectives  are  observed: 

1.  The  model  should  handle  all  information  available  concerning  the  system’s  ac¬ 
tual  uncertainties.  All  impossible  uncertainty  structures  should  be  excluded  to 
reduce  conservatism  of  the  stability  analysis. 

2.  The  model  should  be  as  simple  as  possible  so  that  the  analysis  process  is  not 
unnecessarily  complex.  Simplified  models  also  encourage  an  interactive  design 
process  so  various  control  designs  may  be  quickly  generated  and  compared. 

Naturally,  these  objectives  produce  conflicting  requirements  and  careful  consideration 
must  be  used  to  select  an  appropriate  model.  In  any  event,  a  conservative  stability 
analysis  is  preferable  over  one  that  is  simple  but  erroneous. 

The  three  uncertainty  models  considered  here  include  unstructured  uncertainties, 
element- by-element  structured  uncertainties,  and  block  diagonal  structured  uncer¬ 
tainties.  In  each  of  these  classes,  the  uncertainty  matrix  A(s)  is  complex  with  seme 
form  of  norm  bound  placed  on  the  magnitude  of  the  matrix  or  matrix  elements.  The 
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simplest  uncertainty  class  to  describe  and  manipulate  concerns  unstructured  uncer¬ 
tainties  which  may  be  mathematically  represented  as 

A*  =  {A  |  ||A||2  <Se  3£+} 

where  ||  •  ||2  represents  the  standard  matrix  ‘2-norm  and  8  a  real  scalar.  (Here,  and 
in  the  remainder  of  this  work,  frequency  dependence  will  be  implied).  From  the 
definition,  it  is  apparent  that  no  information  concerning  the  inner  structure  of  A  is 
accessible.  This  effectively  places  a  SISO  bound  on  a  MIMO  system  greatly  hinder¬ 
ing  efforts  to  reduce  conservatism  if  information  about  the  inner  structure  of  A  is 
available. 

The  second  uncertainty  class,  element-by-element  structured  uncertainties,  pro¬ 
vides  a  rich  source  of  information  about  A.  This  class  is  defined  as 

D,  =  {A  =  {*„}  |  |{y|  <  Pij  S  »+) 

where  a  magnitude  bound  is  placed  on  each  element  of  uncertainty  matrix  A  allowing 
information  on  the  inner  structure  of  A  to  remain  a  part  of  the  stability  analysis 
process.  Class  Dt  provides  a  more  realistic  representation  of  real  world  uncertainty 
than  that  of  the  unstructured  uncertainty  class. 

To  illustrate  the  advantages  of  structured  uncertainties  over  unstructured  uncer¬ 
tainties,  consider  some  process  plant  with  two  independent  control  valves  whose  flow 
rate  is  known  within  ±10%  [21].  As  a  structured  uncertainty,  A  could  be  correctly 
represented  as 


,  M<0.1. 
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Placing  a  norm  bound  of  ||A||2  <  .1  as  required  for  unstructured  uncertainties  allows 
a  number  of  impossible  structures  like 


A  = 

0 

0 

*  1 

,  or  A  =  -  • 

0.1 

0.1 

0.1 

0 

0.1 

0.1 

Since  there  are  two  uncertainties,  not  one  or  four,  these  do  not  correspond  to  any  real¬ 
izable  uncertainty  configuration  so  a  stability  analysis  accounting  for  such  impossible 
structures  would  be  unnecessarily  conservative. 

The  third  uncertainty  class,  block  structured  uncertainties,  is  actually  a  superset 
of  the  first  two  classes.  Mathematically  represented  as 

A  =  (A  I  l|A,v||2  <  %  £  S+} 

the  class  Db  may  have  unstructured  submatrices,  A ,-j,  with  arbitrary  dimension.  For 
the  case  where  size(A{j)  =  1x1,  this  class  reduces  simply  to  Ds.  Likewise,  for 
size(Aij)  =  n  x  n  where  n  is  the  size  of  A,  class  Db  reduces  to  class  Du.  Block 
structured  uncertainties  advanced  from  work  done  by  Doyle  [15]  and  they  have  become 
an  important  area  of  study  due  to  their  quite  general  nature. 

Most  real-world  system  uncertainties  may  be  cast  in  one  of  the  three  classes  dis¬ 
cussed  above.  However,  choosing  the  particular  class  for  a  specific  problem  is  seldom 
straightforward  and  assigning  actual  magnitudes  for  the  elements  of  the  chosen  struc¬ 
ture  can  be  more  difficult  still.  Comparing  the  behavior  of  nominal  plant  G0  with  an 
uncertainty  matrix  A  to  that  of  the  actual  system  provides  insight  into  the  accuracy 
of  the  chosen  model  although  a  large  number  of  candidate  plants  and  uncertainty 
structures  may  satisfactorily  match  experimental  data  gathered  during  the  identifi- 
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cation  process.  Narrowing  down  which  of  these  candidate  model  combinations  best 
approximates  the  actual  system  becomes  a  process  of  model  invalidation  a s  those 
models  that  fail  to  satisfy  some  matching  criteria  are  eliminated.  No  discussion  of 
how  a  designer  should  choose  one  model  over  another  will  be  presented  here  since 
this  is  a  complete  field  unto  itself.  However,  a  description  of  the  problem  by  Smith 
and  Doyle  may  be  found  in  [22]. 

While  most  real-world  systems  may  be  characterized  by  one  or  more  of  the  uncer¬ 
tainty  classes  described  here,  it  must  be  noted  that  other  models  offering  improved 
model  agreement  exist  for  some  specific  problems.  Most  notable  of  these  other  mod¬ 
els  are  those  that  deal  with  combinations  of  both  real  and  complex  uncertainties  and 
those  that  address  repeated  or  dependent  uncertainty  structures.  In  the  three  uncer¬ 
tainty  structures  discussed  above,  unstructured,  element-by-element  structured,  and 
general  block  structured,  some  form  of  bound  was  placed  on  the  magnitude  of  the 
uncertainty.  Such  magnitude  bounds  allow  the  phases  of  the  uncertainty  elements 
or  blocks  to  vary  freely  between  0  and  2t r  so  that  worst  case  situations  can  be  ad¬ 
dressed.  For  many  uncertainty  sources  like  unmodelled  dynamics,  this  phase  freedom 
is  mandatory  to  establish  necessary  and  sufficient  stability  conditions.  However,  if 
some  or  all  elements  of  an  uncertainty  structure  are  known  to  be  strictly  real,  this  al¬ 
lowable  phase  variation  may  provide  for  an  overly  conservative  stability  analysis  [23]. 

A  method  for  treating  strictly  real  uncertainties  was  proposed  by  De  Gaston  and 
Safonov  [24].  This  method  transforms  the  perturbed  plant  Gp  into  a  series  of  convex 
hulls  in  the  complex  plane  and,  through  an  iterative  process,  determines  a  stabil- 
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ity  margin  that  indicates  the  largest  scalar  multiplier  of  A  for  which  stability  is 
guaranteed.  This  approach  provides  necessary  and  sufficient  stability  conditions  but 
only  if  an  infinite  number  of  iterations  are  performed.  Fortunately,  usable  results 
are  generally  obtained  for  a  finite  number  of  iterations  though  the  method  is  still 
computationally  intensive. 

For  uncertainties  consisting  of  combinations  of  real  and  complex  elements,  alter¬ 
native  formulations  of  the  structured  singular  value  have  been  proposed  by  several 
authors  [23,  25,  26].  While  these  approaches  appear  to  reduce  the  conservatism  over 
methods  that  consider  only  complex  uncertainty  structures,  a  number  of  theoretical 
and  computational  challenges  must  be  resolved  before  they  can  be  reliably  applied  to 
engineering  problems. 

In  addition  to  special  uncertainty  models  addressing  real  elements,  the  case  of 
repeated  or  dependent  complex  uncertainty  models  requires  special  consideration. 
This  class  is  actually  a  subset  of  the  other  structured  uncertainty  classes  with  the 
distinction  that  two  or  more  of  the  elements  or  blocks  of  A  are  constrained  such  that 
their  phases  must  always  be  the  same  [27].  Thus,  while  the  phases  may  vary  between 
0  and  27r,  they  may  not  do  so  independently  of  one  another. 

While  the  real  and  repeated  uncertainty  element  models  provide  an  improved 
stability  analysis  for  their  particular  cases,  the  two  structured  uncertainty  classes 
presented  earlier  account  for  a  wide  range  of  important  problems.  Therefore,  only 
their  treatment  will  be  considered  in  the  remainder  of  this  work. 


CHAPTER  3 

SINGULAR  VALUE  TECHNIQUES 


As  discussed  in  Section  2.3,  the  necessary  and  sufficient  stability  criterion 

sup/)(MA)<l  (3.1) 

A 

is  nonconvex  making  it  difficult  if  not  impossible  to  determine  the  range  of  permissi¬ 
ble  As.  Rather  than  giving  up  on  this  approach  for  uncertain  multivariable  systems, 
it  is  possible  to  continue  the  analysis  using  relationships  involving  the  singular  value 
decomposition.  These  relationships  start  out  as  conservative  upper  bounds  but  the 
results  of  Chapter  4  show  that  this  conservatism  can  generally  be  eliminated  com¬ 
pletely.  The  singular  value  decomposition  is  defined  for  any  matrix  A  €  CnXn  as 

A  =  XE  Yh  (3.2) 

where  A'  and  Y  are  unitary  matrices  the  columns  of  which  form  the  left  and  right 
singular  vectors  of  A  respectively.  The  diagonal  matrix  E  contains  the  singular  values 
of  A  in  decreasing  order  of  magnitude.  The  singular  values  and  vectors  of  A  can  be 
determined  in  terms  of  the  eigenvalues  and  eigenvectors  of  the  hermetian  forms  AH A 
and  AA]l  as 

AHAYi  =  afYi 

AAhX{  =  ajXi.  (3.3) 
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Other  useful  relationships  of  the  singular  values  and  corresponding  singular  vectors 
include: 


Y{*Aa  =  nX? 
Ah  Xi  =  aiYi 
X?A  =  aiYf*. 


Definition  3.1  The  2-norm  of  A  6  CnXn,  ||A||2,  is  defined  as 


ii  a  it  _  WAxh 

A  2  —  maX  — r: — rj — 
11  11  <¥*€<>  X  2 


where  ||a:||2  is  the  vector  2-norm  defined  as 


MI2  =  vM2  +  '**+  lXn|2- 


Also,  from  Definition  3.1  comes  the  important  inequality  equation 


114*11*  <  HAH2MI2. 


The  following  theorem  relates  the  2- norm  of  a  matrix  to  its  maximum  singular  value. 
Theorem  3.1:  For  any  matrix  A  €  CnXn 


o(A)  =  ||A||2. 


Proof:  From  Equation  3.2,  matrix  A  may  be  written  as  A  =  XEYH.  The  2-norm 
of  a  matrix  is  invariant  to  unitary  transformations  so 


Wh  =  I|a'ef"||2  =  ||S||3. 
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x. 


From  Definition  3.1,  this  may  be  written  as 

||j4||2  =  ||S||a  =  max  =  max  - - - - - — = — . 

j|®||2  o**ec«  ^p  +  .-.  +  l^p 

Choosing  x  =  ei,  the  first  standard  basis  vector  gives  ^H^2  =  (T\{A)  while  any  other 


choice  for  x  gives  <  a\(A).  Therefore,  ||A||2  =  cri(A)  =  &{A). 


¥h 

Using  this  result,  the  following  lemma  relates  v(A)  to  p(A). 
Lemma  3.1  For  any  matrices  A,  B  6  Cnxn, 


p(AB )  <  a(AB). 


(3.7) 


Proof:  From  the  eigenvalue/eigenvector  equation  we  have 


ABWi  =  XiWi. 


where  A,-  is  an  eigenvalue  of  ( AB )  and  Wi  the  corresponding  eigenvector.  Taking  the 
2-norm  of  both  sides  gives 

\\ABWi\U  =  \\XiWih  =  |A*|||Wi||a. 

Equation  3.6  allows  this  to  be  written  as  an  inequality 

WABUmy  >  \\ABW;h  =  MIIK'-flU- 

Applying  Theorem  3.1  along  with  the  definition  of  the  spectral  radius  gives 

a(AB)>  p(AB).  a 

Finally,  the  2-norm  of  a  matrix  product  is  related  to  the  product  of  the  2-norms 


as  shown  in  the  following  lemma. 
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Lemma  3.2  For  A,B$  Cnxn 


a(AB)  <  a(A)a{B). 


(3.8) 


Proof:  Again,  from  Definition 


\\ABxh  .  „nr.  ||A||2||B*||a 

n  r —  <  max  - n — n - 


II  A  70! I  _  _  ^  _ 

\\Atf  \h  —  max  - : —  n  max  r;  ^ 

o?xeCn  j|a:|j2  o^zec"  ||£||2 


„  ,11  \\Bx\U 

A  2  max  — j; — r; 

"  11  0,4*6 C"  x  L 


\\AB\\2  <  \\A\U\\Bh- 


Replacing  A  and  B  with  M  and  A  respectively  in  Equations  3.7  and  3.S,  the  following 
expression  must  hold: 


p(M A)  <a(M A)  <  a(M)a( A). 


From  these  relationships,  the  necessary  and  sufficient  stability  criterion  of  Equation 
3.1  can  be  written  as  the  following  sufficient  stability  conditions 

supa(jV/)o(A)  <  1  (3.9) 

A 

and 

supa(MA)  <  1.  (3.10) 

A 

Since  Equations  3.9  and  3.10  only  guarantee  sufficient  stability  conditions  for 
structured  uncertainties,  additional  manipulations  are  required  to  reduce  conser¬ 
vatism  with  the  intent  of  regaining  the  necessary  stability  condition.  Two  techniques 
that  address  this  problem  are  nonsimilarity  scaling  introduced  by  Kouvaritakis  and 
Latchman  [16]  and  similarity  scaling  advanced  by  Doyle  [15].  Both  techniques  rely 
on  the  fact  that  while  the  eigenvalues  (and  hence  the  spectral  radius)  of  a  matrix  are 
unaffected  by  similarity  transformations,  the  singular  values  are  not  necessarily 
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preserved  and  may;  in  fact,  be  reduced.  The  nonsimilarity  scaling  technique  will  be 
discussed  first  with  the  analysis  limited  to  element  by  element  bounded  uncertainties. 

3.2  Structured  Uncertainties  with  Nonsimilarity  Scaling 

The  nonsimilarity  scaling  formulation  starts  with  the  necessary  and  sufficient 
stability  criterion  of  Equation  3.1.  Introducing  positive,  diagonal  matrices  R  and  L 
to  this  expression  gives  the  equivalent  stability  condition 

supp(MA)  =  supp(R-'ML-'LAR)  <  1. 

A  A 

This  may  be  rewritten  in  the  form  of  Equation  3.9  giving  the  sufficient  stability 
criterion 

snpa(R-xML-l)a{LAR)  <  1.  (3.11) 

A 

Equation  3.11  no  longer  conforms  to  the  similarity  transformation  structure:  hence 
the  name  “nonsimilarity  scaling.”  However,  the  free  elements  of  R  and  L  still  allow 
the  conservatism  gap  between  the  maximum  singular  value  upper  bound  and  the 
spectral  radius  lower  bound  to  be  reduced.  Also,  the  choice  of  positive  values  for 
the  elements  of  R  and  L  allows  a  simplification  based  on  the  following  lemmas  and 
Theorem  3.2. 

Lemma  3.3:  For  all  A  €  Cnxn  and  B  €  5£nXn  where  by  >  0  and  for  all 

i,j ,  with  x+  defined  as  (|a;i|,  •  •  • ,  |xn|)r) 


||Aa;||2  <  IIFte+lj?  for  all  x  €  Cn. 


(3.12) 
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Proof:  From  the  definition  of  a  vector  2-norm 


‘  +  Oln-'Cnl2  +  •  •  ‘  +  l^nl^l  +  •  ‘  +  Gnn^nl2 

||5x+||2  =  \/|  [&lt|2l|  +  *  •  •  4-  &ln|£n|]  I2 +  •••  +  !  [^nlkl|  +  ’ - f*^nn|^n|]  |2- 

The  second  equation  contains  only  nonnegative  numbers  so  the  sum  of  the  individual 
terms  cannot  decrease.  The  first  equation,  however,  contains  elements  that  may  be 
negative  so  the  sum  of  the  individual  terms  may  decrease.  Since  the  elements  of  B 
are  greater  than  or  equal  to  the  corresponding  elements  of  A ,  the  lemma  must 


hold. 


Lemma  3.4  Using  the  definition  of  x+  from  Lemma  3.3, 


|x||2  =  i|z+||2  for  all  x  €  Cn. 


(3.13) 


Proof:  Again,  from  the  definition  of  the  vector  2-norm 


Mb  =  vM2  +  •  •  •  +  kn|2  =  llz+lb- 


Lemma  3.5  For  A  and  B  defined  as  in  Lemma  3.3, 


U*h 


<  j|i?||2  for  all  0^x6  Cn. 


(3.14) 


Proof:  From  Definition  3.1, 

II  oil  _  WBxh 

\\B  2  —  max  - ; r;  ^ — . 

Since  all  elements  of  B  are  nonnegative,  this  may  be  combined  with  Lemmas  3.3  and 


I,  PH  WBx+h  ^  WMU 

\\B  2  —  max  - r; —  -*tt — r; — . 

||*+||2  || a; || 2 


3.4  to  give 
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Theorem  3.2:  Given  A  €  CnXn,  positive,  diagonal  matrices  L,R  €  RnXn  and 
P  €  3?nXn  where  p,-j  >  0  and  |£y|  <  pij  for  all  i,j, 

a(LAR)  <  a(LPR).  (3.15) 


Proof:  This  proof  follows  directly  from  Lemmas  3.3,  3.4,  and  3.5  with  A  replaced 
by  LAR  and  B  replaced  by  LPR  so  that 


\\LAR\\2 


max 

o^xecn 


IIH 

11*11* 


<  max 
—  o^x+ecn 


\\LPR^h 

n*+n* 


WLPRh-  ■ 


Theorem  3.2  and  the  definition  of  element-by-element  structured  uncertainties 
from  Section  2.4  allow  the  direct  substitution  of  P  for  A  in  Equation  3.11. 

Substituting  P  for  A  would  appear  to  add  additional  conservatism  to  Equation 
3.11.  However,  Lemma  3.6  shows  that  this  is  not  the  case. 

Lemma  3.6  Given  LAR  and  LPR  defined  as  in  Theorem  3.2,  the  following  equality 
must  hold: 


snpa{LAR)  =  a{LPR).  (3.16) 

A 

Proof:  The  freedom  of  the  phases  of  each  element  of  A  allows  6,-j  = 
for  all  i,j.  Therefore,  by  individually  adjusting  the  phases  of  each  element  of  A,  the 
equality  of  Equation  3.16  is  established.  ■ 

Since  P  always  represents  a  realizable  variation  of  A,  there  is  actually  no  added 
conservatism  by  substituting  P  for  A.  Therefore  the  sufficient  stability  condition  of 
Equation  3.11  becomes 


inf  a(R~x ML~l)a(LPR)  <  1 

L/,H 


(3.17) 
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over  all  design  frequencies.  This  infimization  over  L  and  R  requires  only  2(n  —  1) 
optimization  variables  because  one  diagonal  element  of  both  L  and  R  may  be  held 
constant  without  loss  of  generality.  For  the  case  of  distinct  Equation 

3.17  has  been  shown  to  converge  to  the  necessary  and  sufficient  condition  of  Equation 
3.1.  This  optimal  condition  is  characterized  by  the  major  principal  direction  align¬ 
ment  (MPDA)  theorem  proposed  by  Kouvaritakis  and  Latchman  [17].  Due  to  the 
importance  of  this  theorem  it  will  be  discussed  in  detail  in  Chapter  4. 

3.3  Structured  Uncertainties  with  Similarity  Scaling 

The  previous  section  outlined  the  nonsimilarity  scaling  techii.  que  which  applies 
to  systems  with  element-by-element  structured  uncertainties.  A  second  approach  to 
solving  Equation  3.1  through  singular  value  techniques  involves  diagonal  similarity 
scaling  where  only  one  scaling  matrix  is  used.  Advanced  by  Doyle  [15],  this  technique 
requires  ths'  the  uncertainty  matrix  A  be  diagonal  with  norm  bounds  on  the  diagonal 
elements  or  blocks.  An  important  advantage  of  diagonal  similarity  scaling  is  its 
ability  to  address  the  general  block  structured  uncertainties  discussed  in  Section  2.4. 
While  the  requirement  for  diagonalized  uncertainties  appears  to  severely  restrict  the 
applicability  of  this  technique,  it  is  possible  to  diagonalize  any  uncertainty  matrix 
through  the  use  of  simple,  eigenvalue  preserving  transformations.  Properties  of  these 
transformations  will  be  discussed  in  Section  3.5. 

For  diagonal  uncertainty  matrix  A,*,  diagonal  scaling  matrix  D  may  be  introduced 
allowing  the  necessary  and  sufficient  stability  conditions  of  Equation  3.1  to  be  written 
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as 

sup  p(M Ad)  =  sup  p(DMAdD~1)  <  1  (3.18) 

Ad  A  d 

and  the  sufficient  condition  of  Equation  3.10  as 

supaiDMAdD'1)  <  1.  (3.19) 

Ad 

Taking  advantage  of  the  diagonal  nature  of  permits  a  decomposition  of  the  form 

=  PdUd 

where  pdti  >  !£<*,.,  |  for  all  i  and  Ud  is  a  diagonal  unitary  matrix.  This  allows  Pd  to 
retain  the  magnitude  information  and  Ud  the  phase  information  of  Ad.  Substituting 
PdUd  for  Ad  in  Equation  3.19  gives  the  sufficient  stability  condition 

sup p(MAS)  <  sup a(DMPdUdD~l)  <  1. 

Ad  ud 

Noting  that  diagonal  matrices  commute  with  each  other,  the  positions  of  Ud  and  D~x 
may  be  exchanged  making  the  condition 

sup/3(MA,f)  <  sup a(DMPdD~xUd)  <  1.  (3.20) 

Ad  Vd 

However,  the  2-norm,  and  hence  the  maximum  singular  value,  of  a  matrix  is  invariant 
to  unitary  transformations  removing  the  dependence  on  Ud •  Combining  M  and  Pd 
into  Ma  =  MPd ,  the  condition  of  Equation  3.20  becomes 

sup p(M Ad)  <  inf a(DMaD~x)  <  1.  (3.21) 

Ad  & 

Since  the  spectral  radius  of  MaUd  is  always  bounded  from  above  by  the  maximum 
singular  value  of  DMaD~l ,  the  free  elements  of  D  may  be  used  to  reduce  the  conser¬ 
vatism  gap  between  the  maximum  singular  value  upper  bound  and  the  spectral  radius 
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lower  bound  of  Equation  3.21.  For  the  general  case  where  a  stationary  point  occurs 
at  the  “inf,”  the  resulting  optimal  value  of  &(DMaD~l)  is  known  as  the  “structured 
singular  value”  of  Ma  (denoted 

The  original  definition  of  the  structured  singular  value  had  the  form  [15] 


KMa)  = 


0  if  no  A  solves  det{[I  +  M A]}  =  0 
[  else  (minA  {^(A)  |  det{[I  +  MA]}  =  0})-1  J 
While  this  expression  is  rather  unwieldy,  a  more  useful  alternative  formulation  is  given 


by 


fi(Ma)  =  snpp(MaUd)  <  inf  cr(DMaD  l).  (3.22) 

The  right-hand  side  of  Equation  3.22  has  been  proven  convex  (with  respect  to  scaling 
matrix  eD )  [28,  29]  and  the  MPDA  formulation  for  similarity  scaling  shows  that  the 
inequality  becomes  an  equality  at  the  “inf”  for  stationary  points  of  a(DMaD~l)  [17], 
Chapter  4  reviews  conditions  required  for  the  optimal  solution  to  Equation  3.22. 
However,  an  interesting  suboptimal  solution  was  proposed  by  Safonov  involving  the 
Frobenius  norm  minimization  of  DMaD~l  [14].  Since  the  Frobenius  norm  of  a 
matrix  provides  an  upper  bound  on  the  2— norm  (and  hence  the  maximum  singular 
value)  of  the  matrix,  reducing  the  Frobenius  norm- of  DMaD~x  generally  serves  to 
reduce  a(DMaD~l)  as  well.  An  iterative  algorithm  introduced  by  Osborne  performs 
such  a  reduction  by  choosing  elements  of  D  that  equalize  the  row  and  column  sums  of 
DMaD~x  [30].  While  this  procedure  produces  a  conservative  upper  bound  for 
it  is  numerically  inexpensive  since  no  singular  value  decompositions  are  required. 
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3.4  Block  Structured  Uncertainties  with  Similarity  Scaling 

As  discussed  in  Section  2.4,  the  family  of  block  structured  uncertainties  represents 
the  most  general  class  since  it  is  a  superset  of  both  the  unstructured  and  element- 
by-element  structured  uncertainty  classes.  Let  At  represent  a  block  diagonal  matrix 
containing  both  scalar  and  nonscalar  uncertainty  elements.  As  in  Section  3.3,  this 
can  be  decomposed  giving 

At  =  PbUb 

where  Pb  contains  the  magnitude  bounds  and  Ub  the  phase  information  of  Ab.  An 
important  difference  between  this  decomposition  and  that  of  Section  3.3  is  the  re¬ 
quirement  for  Pb  and  Ub  to  reflect  the  block  structure  of  Ab.  For  this  configuration, 
the  diagonal  scaling  matrix  Db  must  also  reflect  the  block  structure  requiring  repeated 
elements  of  Db  as  necessary  to  correspond  to  the  size  of  the  individual  blocks  in  A*,. 
With  the  exception  of  maintaining  the  block  structure,  this  formulation  is  identical 
to  that  for  element-by-element  uncertainties  and,  in  fact,  the  block  form  of  Equation 
3.22  simply  becomes 

n(Mb)  =  sup  p(MbUb)  <  inf  a{DbMbDbl)  (3.23) 

Ub  Db 

where  Mb  =  MPb. 


3.5  Diagonalizing  Transformations 

As  mentioned  earlier,  it  is  always  possible  to  reconfigure  any  structured  A  into 
diagonalized  form.  First  consider  uncertainty  class  Ds  described  in  Section  2.4  where 
A  €  Cnxn  contains  n2  complex  elements  Si  to  6n 2.  Next  define  A d  €  C”2*"2  = 
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diag[8i, . . . ,  fo],  It  is  always  possible  to  relate  Aj  to  A  using  transformation  matrices 
E\  €  3JnXn2  and  E2  €.  3?n2xn  containing  only  l's  and  0's  such  that  E\AdE 2  =  A.  From 
this  relationship,  the  necessary  and  sufficient  stability  conditions  of  Equation  3.1  may 
be  written  as 

sup  piMEtAdEi)  <  1. 

A  d 

At  this  point,  some  means  of  recovering  the  simplifying  properties  of  diagonal  matrices 
required  by  Equation  3.20  is  needed.  The  following  theorem  relates  the  eigenvalues 
(and  hence  the  spectral  radius)  of  matrix  products  ( AB )  and  (BA). 

Theorem  3.3:  Given  A  €  CnXm  and  B  6  Cmxn, 

Xi(AB)  =  Xi(BA)  i=  l,...,n. 

Proof:  Define  A;  and  A_j  as  the  eigenvalues  of  (AB)  and  (BA)  respectively.  Then 

(AB)Wi  =  XiWi  i  =  1, . . .  ,n 
(BA)Y;  =  XjYj  j  =  l,...,m 

where  Wi  €  CnXl  and  Yj  €  CmXl  form  the  corresponding  eigenvectors.  Multiply  the 
first  equation  by  B  on  the  left  giving 

(BAB)W{  =  XiBWi. 

Vector  BW{  €  Cmxl  must  be  an  eigenvector  corresponding  to  the  zth  eigenvalue  of 
(BA)  as  well  as  the  ith  eigenvalue  of  (AB).  Since  the  eigenvalues  of  a  matrix  are 
unique,  A i(AB)  =  X{(BA)  for  i  =  1  ,...,n.  ■ 
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Using  this  result,  replace  A  by  ME\Pd  and  B  by  E2  so  the  spectral  radius  equa¬ 
tions  become 

p(MA)  =  p(E2ME1  Ad). 

Next,  define  Ma  =  E2ME\  and  the  necessary  and  sufficient  stability  condition  of 
Equation  3.1  may  be  written 

sup  p(MaAd)  <  1. 

Starting  at  this  point,  the  analysis  of  Section  3.3  can  then  continue  as  before. 

While  this  diagonalizing  transformation  is  straightforward,  it  has  the  unfortunate 
consequence  of  increasing  the  system  size  from  n  x  n  to  as  much  as  n2  x  rr  with  a 
resultant  increase  in  computational  requirements  to  compute  the  maximum  singular 
values.  Also,  the  number  of  optimization  variables  required  to  infimize  Equation  3.22 
can  increase  to  n2  —  1  versus  2(n  —  1)  for  nonsimilarity  scaling.  (As  with  matrices  L 
and  R  of  Equation  3.17,  a  single  element  of  D  may  be  arbitrarily  fixed  allowing  the 
elimination  of  one  optimization  variable).  Even  with  these  disadvantages,  the  ability 
to  manage  block  structured  uncertainties  makes  similarity  scaling  an  invaluable  tool 
in  the  analysis  of  uncertain  systems. 


CHAPTER  4 

THE  MAJOR  PRINCIPAL  DIRECTION  ALIGNMENT  PROPERTY 


4.1  Application  to  Similarity  Scaling 

The  use  of  singular  value  techniques  in  the  form  of  nonsimilarity  and  similarity 
scaling  has  the  disadvantage  of  initially  reducing  the  necessary  and  sufficient  stabil¬ 
ity  conditions  of  Equation  3.1  to  simply  sufficient  conditions.  For  convenience,  the 
relevant  equations  are  repeated. 

supp(MA)  <  inf  "a(R~l  ML~l  )a(LPR)  Nonsimilarity  Scaling  (4.1) 

•A  £/,H 

sup p(M A)  <  \\\io(DMaD~x)  Similarity  Scaling.  (4.2) 

a  D 

Eliminating  conservatism  in  these  singular  value  formulations  requires  establishing 
conditions  for  which  the  inequalities  hold  with  equality.  This  section  reveals  that 
achieving  equality  in  Equations  4.1  and  4.2  is  simply  a  matter  of  invoking  the  MPDA 
property  [17].  Although  MPDA  conditions  have  been  shown  to  exist  for  both  non¬ 
similarity  and  similarity  formulations,  this  discussion  will  focus  only  the  similarity 

scaling  techniques. 

Definition  4.1:  The  major  right  principal  direction,  Vi,  and  the  major  left  prin¬ 
cipal  direction,  X\,  of  matrix  A  are  the  right  and  left  singular  vectors  of  matrix  A 
corresponding  to  a(A). 
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Theorem  4.1:  For  any  matrix  A  €  CnXn 

p{A)  =a{A) 

if  and  only  if  X\  and  Y\  are  aligned  within  a  scaling  factor  e3°  such  that 

Yi  =  e*Xx.  (4.3) 

Proof:  For  notational  simplicity,  denote  a(A)  by  a  and  p(A)  by  p.  Vectors  X\  and 
Vi  from  the  singular  value  decomposition  must  be  unique  with  respect  to  one  another 
within  a  scaling  factor  ej6.  Multiplying  both  sides  of  Equation  4.3  on  the  left  by  A 
gives 

AVI  =  e*  AX 

Also,  the  relationships  of  Section  3.1  show  that  AYr  =  aX i  so  this  can  be  rewritten 
as 

AXi  =  ae~]dX1 

indicating  that  ae~j0  actually  corresponds  to  an  eigenvalue  of  A.  Since  eigenvalues 
of  a  matrix  are  always  magnitude  bounded  by  the  maximum  singular  value  of  the 
matrix,  sufficiency  of  the  theorem  is  established. 

Establishing  the  necessity  of  the  theorem  starts  with  the  assumption  that  a  =  p 
so  some  eigenvector  Z\  exists  where 

AZt  =  e*aZx  (4.4) 

and 


Z?Ah  =  e'3*aZ({. 


(4.5) 
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Multiplying  Equation  4.4  on  the  left  by  Equation  4.5  and  noting  the  AHA  =  YYZY 
provides  the  following  expression: 

Z(IAHAZl  Z(1YZ2YHZl  _2 
Z?Zn  ~  Z(lYYHZi  ~  ^  ' 

This  may  be  simplified  by  defining  a  new  vector  W\  =  Y^ Z\  to  give 


w?r?wx  _2 

Wj'W,  ~ a  ' 


(4.6) 


Equation  4.6  can  only  be  satisfied  if  W\  =  eJ^ei  where  e\  is  the  first  standard  basis 
vector.  Therefore,  Z\  =  YW\  —  e^Ye i  =  e^Yi.  Substituting  back  into  Equation  4.4 
then  gives 

Ae*Yx  =  Tfd*Xx  =  e*Se*Yx 

or 

Xx  =  e*Yx 

establishing  the  necessity  of  the  theorem.  ■ 

Corollary  4.1:  For  the  case  of  repeated  singular  values  of  A  €  Cnxn  where  q  denotes 
the  multiplicity  of  cri, 

p{A)  =  ffx{A)t...tffq{A) 


if  and  only  if  the  subspaces  spanned  by  X\, . . . ,  Xq  and  Y\, . . . ,  Yq  are  aligned  within 
a  scaling  factor  ej9. 

Proof:  From  Equation  3.2  the  singular  vectors  of  matrix  A  with  a  repeated  maxi- 
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mum  singular  value  are  defined  as 

AHAYX  =  a\Yx  AAHXx  =  <t\Xx 

:  and  : 

AHAYq  =  a\Yq  AAHXq  =  a\Xq 

Therefore,  linear  combinations  of  the  first  q  columns  of  X  or  Y  form  legitimate 
solutions  for  X\  and  Y\  respectively.  Combining  these  solutions  with  Theorem  4.1 
completes  the  proof.  ■ 

The  MPDA  property  therefore  provides  an  analytical  test  for  equality  in  Equation 
3.21  through  which  the  necessary  and  sufficient  stability  conditions  of  Equation  3.1 
apply.  This  allows  the  infimization  of  Equation  3.22  to  be  performed  while  providing 
a  direct  test  as  to  whether  the  structured  singular  value,  ( fi{Ma )),  has  been  achieved. 
It  would  be  desirable  if  this  infimization  was  somehow  guaranteed  to  invoke  MPDA 
and  hence  attain  n(Ma).  Fortunately,  such  a  guarantee  is  provided  by  the  following 
theorem. 

Theorem  4.2:  The  infimizing  solution  with  respect  to  diagonal  scaling  matrix  D 
of  stability  criterion 

inf  a(DMaD~l)  <  1  (4.7) 

is  both  necessary  and  sufficient  provided  that  =  0  at  the  “inf.” 

Proof:  The  sufficiency  of  the  theorem  was  shown  in  Equation  3.21.  Necessity  is 
established  as  long  as  MPDA  can  be  achieved.  For  simple  7r(DMaD~l),  the  optimal 
solution  of  Equation  4.7  occurs  when 


^(W)  =  o 
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indicating  a  stationary  point.  As  mentioned  in  Section  3.3,  the  left-hand  side  of 
Equation  4.7  is  convex  with  respect  to  scaling  matrix  eD  so  any  local  minimum  is  the 
global  minimum.  Therefore  the  only  stationary  point  must  correspond  to  the  optimal 
solution.  By  direct  differentiation,  the  stationarity  condition  becomes 

|^(&)  =  [m»bA  -  y,  =  o  (4.8) 

where  Ma  =  DMaD~l ,  and  E,  is  a  square  matrix  containing  a  “1”  in  the  nth  position 
and  “Os”  elsewhere.  (Note:  the  actual  differentiation  is  omitted  here  but  the  complete 
differentiation  of  a  similar  function  appears  in  Section  5.5). 

Examining  conditions  at  which  Equation  4.8  equals  zero  reveals  the  requirement 
that 

|®i«|  =  Js/if  |  i  =  (4*9) 

which  is  in  fact  the  MPDA  requirement  that  the  magnitudes  of  the  elements  of  A'i 
and  Yi  be  equal. 

Completion  of  the  proof  from  this  p.;int  requires  that  some  Ud  exists  such  that 
the  phase  alignment  of  X\  and  Y\  occurs.  The  existence  of  such  a  Ud  then  guarantees 
that  p(MaUd )  =  a{DMaD~xUd )  =  ^(M0).  From  the  singular  value  decomposition, 
the  principal  singular  vectors  are  related  as 

(DMaD~1)Yi  =■  aXi 
Xl!{DMaD~l)  =  aYf. 


Introducing  a  diagonal  unitary  matrix,  Ud,  allows  these  equations  to  be  rewritten  as 
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(DMaD~lUd)U^Y\  =  aXx 
X”{DMaD~xUd )  =  aYfUd.  (4.10) 

From  Equation  4.10  it  is  apparent  that  the  diagonal  elements  of  Ud  may  be  used  to 
alter  the  phases  of  A"i  and  Y\  while  continuing  to  satisfy  the  MPDA  requirement  that 
the  magnitudes  of  the  corresponding  elements  of  Ai  and  Y\  be  equal.  The  ability  to 
achieve  independent  phase  alignment  of  X\  and  Y\  by  Ud  completes  the  proof.  ■ 

The  result  of  this  theorem  is  that  optimization  of  Equation  4.7  has  the  effect 
of  directly  invoking  MPDA  at  the  “inf”  for  simple  «f,  guaranteeing  the  necessary 
and  sufficient  stability  conditions.  Cases  with  nonsimple  a  (known  as  “cusps”)  do 
not  necessarily  contain  a  stationary  point  at  the  “inf”  and  must  be  analyzed  using 
different  techniques.  A  more  complete  discussion  of  cusping  systems  appears  in 
Chapter  8. 


4.2  Block  Structured  Uncertainties 

By  following  the  MPDA  development  for  scalar  uncertainties,  a  similar  formu¬ 
lation  results  for  general  block  structured  uncertainties.  The  main  difference  is  the 
requirement  of  maintaining  the  individual  block  structures  rather  than  considering 
each  element  of  A  individually.  As  discussed  in  Section  3.4,  this  block  structure  must 
be  reflected  in  the  scaling  matrices  as  well.  The  optimal  solution  is  then  characterized 
by  the  equalization  of  the  2-norms  in  the  corresponding  blocks  of  the  left  and  right 
principal  vectors  X\,  and  Y\  respectively.  The  same  guarantee  of  achieving  equality 


38 


at  the  optimal  solution  of  Equation  3.23  for  simple  cr(j applies  as  in  the 
element-by-element  structured  uncertainty  case. 


4.3  Examples 

The  following  examples  illustrate  the  methods  described  in  the  previous  sec¬ 
tions. 

Example  4.1:  Let  randomly  generated  matrices  M  €  C3x3  be  chosen  such  that 

'  .063  +  .156;  -.322  +  .480;  .585  +  .526;  ' 

M  =  .726  -.514;  -.323 -.344;  .150 -.469;  . 

_  .189  -  .463;  .053  -  .577;  -.236  -  .056; 

Next  chose  a  random  element- by-element  structured  uncertainty  matrix  A  €  C3x3 

with  corresponding  P  6  3?3x3  written  as 

‘  |ft|  \S2\  14,1  [*  2.99  3.03  0.54  ' 

N  |4|  |4|  <  1.65  1.87  3.41  =  P. 

.  M  I4|  |4|  J  L  1.90  1.20  1.37  _ 

Formulated  as  a  nonsimilarity  scaling  problem, 

<  inf  a(Rr'ML-l)a{LPR) 

the  diagonal  elements  of  the  optimal  R  and  L  scaling  matrices  are  found  to  be 

ri  =  1  1  [  /i  =  1 

r2  =  1.151  ,  L  =  l2  =  0.944 

r3  =  1.174  J  /3  =  1.203 

The  maximum  singular  values  do  not  repeat  for  this  example  so 

n(Ma)  =  a{R-lML~1)a(LPR)  =  8.25. 

This  same  problem  may  be  recast  into  a  similarity  scaling  form  by  first  selecting 


transformation  matrices  E\  and  E2  as 
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Expanding  M  and  P  as  described  in  Section  3.3,  the  optimal  solution  of  Equation 

3.22  requires  scaling  matrix  D  with  diagonal  elements 

di  =  1 

d2  =  0.906 
d3  =  0.406 
<4  =  0.719 

D  =  d5  =  0.688  . 
d6  =  0.988 
d7  =  0.783 
ds  =  0.560 
dg  =  0.636  . 

As  before,  fi(Ma)  =  a(DMaD~l )  =  8.25.  The  absolute  values  of  the  left  and  right 
principal  directions  for  this  optimal  solution  are 


’  1  1  T  1 

1.119  1.119 

0.444  0.444 

0.719  0.719 

|  AT  |  =  0.850  =[^1=  0.850 

1.080  1.080 

0.783  0.783 

0.691  0.691 

.  0.695  J  0.695 


As  required  by  the  MPDA  theory,  |Ai|  =  |Yi|  at  the  “inf”  guaranteeing  that  some 
Ud  exists  such  that  p(MaUd )  =  a{DMaD~l)  =  p(Ma). 

Example  4.2:  Using  the  M  matrix  from  Example  4.1,  define  a  block  structured 


uncertainty  matrix  as 
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A  = 


<54 

<5S 


$3 

A  6 


whore  through  65  are  scalar  uncertainties  such  that  |<5,|  <  pi  and  Aj,  is  a  2  x  2  block 


with  <r(Ai)  <  pt,.  Block  Aj  may  be  represented  as 


A4  =  PbUb 


where  Pb  is  a  2  x  2  matrix  with  cr(Pb)  =  Pb  and  Ub  is  a  2  x  2  unitary  matrix  containing 


the  phase  information  of  A b-  We  may  now  define  a  matrix  Pa  containing  randomly 


generated  bounds  on  the  and  Aj  elements  as 


Pa  = 


'  2.99  3.03  0.54  ' 
1.65 

1.90  Pb 


with  a(Pb)  =  1.87.  It  is  easily  shown  that  the  worst  case  structure  for  Pb  in  terms  of 


fi(Mb)  occurs  when 


P6  = 


1.87  0 

0  1.87 


allowing  a  P  matrix  for  this  case  to  be  written  as 


P  = 


2.99  3.03  0.54 
1.65  1.87  0 

1.90  0  1.87 


The  2x2  block  structure  constrains  the  last  two  terms  of  scaling  matrix  Db  to  be 


identical  thus  eliminating  one  degree  of  freedom  from  the  optimization.  Also,  the  two 
zero  elements  of  P  allow  the  diagonalized  matrix  Pd  (and  hence  Mb)  to  be  of  size 
7x7  rather  than  9x9. 


Choose 
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corresponding  to  Pi  —  diag[ 2.99,  3.03,  0.54,  1.65,  1.9,  1.87,  1.87]  so  that  P  = 


EiPdE2-  Expanded  matrix  Mb  may  then  be  formed  as  Mb  =  E2MExPi .  Following 


the  infimization  of  a(DbMbDb  *),  the  optimizing  Db  and  the  absolute  values  of  the 


elements  of  X\  and  Ft  are  found  to  be 


'  dx  =  1 

'  0.427  ‘ 

'  0.427  ' 

d2  •■=  0.82 

0.527 

0.527 

dz  =  0.41 

0.187 

0.187 

=  0.63 

,  i*,i  = 

0.269 

0.269 

ds  =  0.87 

0.373 

0.373 

d6  =  0.69 

0.442 

0.279 

d7  =  0.69 

0.313 

0.465 

with  fi(Mb)  =  6.5.  Note  that  the  first  five  terms  of  |A''i|  and  |yi|  are  equal  as  in  the 
case  of  scalar  uncertainty  blocks  under  MPDA  conditions.  However,  the  presence  of 
the  2  x  2  block  causes  the  last  two  terms  of  IXil  and  |yi|  to  be  related  in  the  following 


manner: 


1^61 12  +  |z7i|2  =  I2/61 12  +  I2/71 12  =  0.2937. 


As  described  in  Section  4.2,  this  equality  between  norms  of  the  corresponding  blocks  of 
X\  and  Y\  occurs  whenever  MPDA  is  established  with  block  structured  uncertainties 
guaranteeing  the  existence  of  some  optimal  (jb  such  that  p(MbUb)  =  a(DbMbDbl)  = 
fi(Mb)  [31]. 

The  material  presented  so  far  provides  a  motivation  for  computing  the  structured 
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singular  value  using  singular  value  scaling  techniques.  However,  even  with  the  ex¬ 
amples,  very  little  insight  can  be  gained  into  the  computational  issues  involved  with 
determining  \i .  In  fact,  although  the  structured  singular  value  concept  is  currently 
being  applied  to  the  design  of  large  engineering  problems  [23],  a  number  of  difficul¬ 
ties  continue  to  hinder  its  application.  The  next  chapter  shows  that  knowledge  of  the 
uncertainty  structure  can  lead  to  a  reduction  in  the  number  of  optimization  variables 
required  to  compute  n  using  similarity  scaling. 


CHAPTER  5 

REDUCTION  IN  THE  NUMBER  OF  OPTIMIZATION  VARIABLES  REQUIRED 
TO  COMPUTE  THE  STRUCTURED  SINGULAR  VALUE 

5.1  Reduction  of  Optimization  Variables 

As  mentioned  earlier,  similarity  scaling  techniques  have  the  capability  of  handling 
general  block  structured  uncertainties.  While  this  advantage  justifies  the  use  of  simi¬ 
larity  scaling  for  systems  with  these  uncertaint  structures,  the  requirement  that  the 
uncertainty  blocks  be  diagonalized  results  in  a  possibly  substantial  increase  in  overall 
system  size.  For  scalar  blocks  this  expansion  transforms  an  n  x  n  system  to  an  n2  x  n2 
system.  Not  only  does  this  extended  system  size  increase  the  floating  point  operation 
(flop)  count  for  each  singular  value  decomposition,  but,  as  discussed  in  Section  3.5, 
it  also  increases  the  number  of  optimization  variables  from  2 (n  —  1)  for  nonsimilarity 
scaling  to  n2  —  1  for  similarity  scaling.  Since  M,  A  and  hence  fi(Ma)  are  frequency 
dependent  functions,  a  frequency  sweep  of  n  must  be  performed  to  determine  the 
worst  case  condition.  Depending  on  the  operating  frequency  range  of  the  control 
system  under  investigation,  this  frequency  sweep  may  involve  hundreds  or  thousands 
of  individual  points  at  which  \i  must  be  evaluated.  Each  of  these  points  therefore 
represents  a  substantial  cost  in  CPU  time. 

Since  both  nonsimilarity  and  similarity  scaling  methods  produce  identical  results 
for  element-by-element  structured  uncertainties,  it  would  seem  reasonable  to  con¬ 
clude  that  some  of  the  n2  —  1  degrees  of  freedom  required  for  similarity  scaling  are 
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actually  redundant.  Removing  these  unnecessary  optimization  variables,  if  possible, 
should  provide  for  some  corresponding  reduction  in  flops  required  to  compute  p  using 
similarity  scaling.  The  following  theorem  and  subsequent  proof  show  that,  in  fact, 
no  more  that  2 (n  —  1)  optimization  variables  are  required  to  compute  p  for  both 
similarity  and  nonsimilarity  scaling. 

Theorem  5.1:  Given  M  6  Cnxn  and  A  €  CnXn,  where  A  is  composed  of  1  x  1 
blocks,  the  solution  for  p(Ma)  in  the  optimization  problem 


p(Ma)  <  inf  a(DMaD~l) 


(5.1) 


can  be  found  with  no  more  than  2(n  —  1)  free  variables  as  long  as  a  stationary  point 
occurs  at  the  “inf.” 

Proof:  To  reduce  notational  complexity  this  proof  will  be  developed  explicitly  for 

the  2x2  case  and  the  general  n  x  n  case  will  follow  as  a  simple  extension.  Let 

62 

J3  84 

=  P, 


_ 

mu  mi2 

,  A  = 

i 

m2i  m22 

i 

'1*1 

1  w 

< 

Cl 

£ 

& 

.  \5s 

ij  m  _ 

_  P21  P22  J 

and  Pd  =  diag\pn:pl2,P2i,p22] ■  For  this  Pd  choose 


£1  = 


110  0 
0  0  11 


,  £2  = 


1  0 
0  1 
1  0 
0  1 


Applying  scaling  matrix  D  =  diag[di,  d2,  e/3,  d 4]  to  Ma  gives 


DMaD -1  = 


mnpn 

j^mi2P2i 

^rnuP22 

g m2\Pi\ 

m2\Pn 

^m22P2i 

fm22P22 

^™ll/>12 

m\2P2\ 

ftrnup22 

rri2xpn 

^m2ipi2 

jL3™22P2i 

m22P22 

(5.2) 
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Also,  from  the  singular  value  decomposition, 

DMaD~l  =  XZY11  (5.3) 

where  the  orthogonal  matrices  X  and  Y  represent  the  left  and  right  singular  vectors 
of  ( DMaD~l )  respectively  and  £  is  a  positive  diagonal  matrix  containing  the  singular 
values  of  ( DMaD~l )  in  decreasing  order  of  magnitude. 

For  the  2x2  case,  matrix  M  must  have  rank<  2.  Multiplying  M  on  the  left  by 
Ei  and  on  the  right  by  E\Pd  to  form  Ma  therefore  requires  that  rank(Ma)<  2  forcing 
cr3  =  <r4  =  0.  Expansion  of  Equation  5.3  into  its  individual  elements  gives 


xnyna\  +  *w?i2er2  '  ’  ’  xnV4iai  +  x\^y^i 
,  xnVn<7i  +  a?2i2/4icri  +  ®22j/42<r2  ,  x 

DMaD~l  =  .  (5.4) 

^lyn^i  +  xziy\YJ,i  •  •  •  xz\y4\a\  +  ^32' y 42°2 

x4iyn<J\  +  a;42yi2cr2  •  •  •  x4\y4\a\  +  *£422/42^2 
Examination  of  Equation  5.2  shows  that  the  ratio  of  the  first  and  third  rows 

is  a  scalar  constant  ai  =  d\/d 3.  Since  Equation  5.2  and  Equation  5.4  both  equal 

DMaD~l ,  the  ratio  of  the  first  and  third  rows  of  Equation  5.4  must  also  be  ot\. 

Equating  the  first  row  of  Equation  5.4  to  on  times  the  third  row  requires  that 


Vu 

2/21 

2/31 

V-ii 


_  ^2(^12  —  #1^32) 

9l2<ri{x  11  -  ai*3i) 
_  cr2(  X\2  —  &\XZ2) 
V22<rr(xn  ~  «i^3i) 

_  C2(^12  ~  Q  1^32) 

Vz2cri{xn  -  an  *31) 
-  al(xW  ~  aix32) 

V42cri(x n  -  ai»3i)’ 
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This  set  of  equations  implies  a  relationship  between  Y\  and  Y2  of  the  form  Y\  =  kY2 
where  k  =  |2l£i2^j£22i.  However  a  linear  dependence  between  Vi  and  Y2  is  impossible 
because  the  columns  and  rows  of  Y  are  orthogonal  to  one  another.  This  inconsistency 
is  satisfied  only  for  in  =  0:1X31  and  X12  =  01X32.  Relationships  between  the  remaining 
elements  of  X\  and  Y\  may  be  developed  through  a  similar  process.  The  second  and 
fourth  rows  of  5.4  have  a  ratio  o2  =  d2/d4  requiring  that  X2i  =  02X41  and  x22  =  a2x42. 
Finally,  the  ratio  of  the  first  and  second  columns  of  Equation  5.4  is  03  =  and 
the  ratio  of  the  third  and  fourth  columns  of  Equation  5.4  is  04  =  These 

relationships  between  the  columns  of  Equation  5.4  require  that  yu  =  a2y2i  and 
y3i  =  ct4y4\.  Summarizing  these  results  gives 


xu 

yn 

A'i  = 

x2i 

,  Vi  = 

yn/<*3 

xu/ari 

2/31 

x2i/a2 

ysi/a-i 

so  that  only  two  independent  terms  (xn,  X21  in  X\,  and  yu,  yz\  in  Ti)  appear  in 
each  vector.  From  Section  4.1,  equality  between  the  left-hand  and  right-hand  sides 
of  Equation  5.1  occurs  only  when  MPDA  conditions  are  established.  This  in  turn 
requires  pair-wise  equality  between  the  elements  of  Xi  and  Y\  or 

\xn |  =  |y,-i |  i  =  l,...,n2  (5.6) 

for  similarity  scaling  with  simple  a(DMaD~x).  Applying  the  requirements  of 
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Equation  5.6  to  Equation  5.5  gives 


l®n|  =  bill 


Nil 

kill 

ai 

kail 

a2 


=  bul 

=  bail 
bail 


knl 

«3 


knl  _  bul 

aiO;4  a2o;3 


From  this  series  of  relationships  the  following  equality  must  hold  under  MPDA  con¬ 
ditions: 

or2o:3 

Replacing  the  a,/a  in  Equation  5.7  with  the  corresponding  (/,•/,  and  p;j>,  gives 

dldjpnV2i  _  a 

d\pnVn  4 

which  shows  that  d. t  can  be  expressed  as  a  simple  function  of  d\,  d2,  d3  and  the  p;ji3. 
The  proof  for  the  2x2  case  is  completed  by  noting  that  one  of  the  d{  can  be  always  be 
scaled  to  “1”  without  loss  of  generality  so  that  the  total  number  of  free  optimization 
variables  is  2  =  2 (n  —  1). 

Having  completed  the  proof  for  the  2x2  case,  the  proof  for  the  general  n  x  n 
case  follows  in  the  same  manner.  For  M  6  CnXn  the  rank  of  expanded  matrix  Ma 
must  be  less  than  or  equal  to  n  requiring  that  an+ 1, . . . ,  ani  =0.  Continuing  with  the 
process  of  equating  rows  and  columns  of  Equations  5.2  and  5.4  produces  relationships 
between  A'i  and  Y\  similar  to  those  of  Equation  5.5  when  MPDA  is  established.  The 
number  of  independent  vector  elements  in  X\  and  Y\  becomes  n  for  the  general  case 
and  (n— l)2  expressions  in  the  form  of  Equation  5.7  describe  the  relationships  between 


(5-7) 
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the  2 n{n  —  1)  ay/,,.  From  these  expressions  a  general  equation  relating  the  dependent 
and  independent  values  of  optimal  scaling  matrix  D  can  be  written  as 

^2  _  ^i-n^n+\PuP2,(i-n)  g\ 

‘  rf|P(l,(i-«))P2,l 

i  =  n +  2,  ...,2n; 

d2  _  <ff-2A+lPllP3.(j-2n) 

;  dlP(l,(j-2n))P3,l 

j  =  2n  +  2, . . . ,  3 n; 

^fc-(n-l)n^fn-l)n+lPtlP(n,fc-(»-l)») 

^lP(l,k~(n—l)n)Pn,l 

(n  —  l)n  +  2,  •  •  •  ,n2. 

Subtracting  the  (n  —  l)2  expressions  in  the  form  of  Equation  5.7  from  the  n2  —  1 
variables  required  for  the  unreduced  scaling  structure  gives  the  desired  result  of 

n2  —  1  —  (n  —  l)2  =  2(n  -  1) 

optimization  variables.  ■ 

The  complexity  of  the  indices  in  Equation  5.8  unfortunately  obscures  the  simplicity 
of  its  application.  The  following  example  will  hopefully  clarify  these  results. 

Example  5.1:  Let  randomly  generated  matrices  M  €  C3x3  and  P  €  3£3x3  be  chosen 

as  in  Example  4.1  such  that 

'  .063  +  .156;  -.322  +  .480;  .585  +  .526;  ‘ 

M  =  .726  -.514;  -.323  -.344;  .150  -.469; 

_  .189  -  .463;  .053  -  .577;  -.236  -  .056;  _ 

and 

"  Pu  Pn  P13  1  *  2.99  3.03  0.54  " 

P  =  Pn  P22  P23  =  1-65  1.87  3.41  . 

_  P3i  P32  P33  J  1.90  1.20  1.37 


49 


Then  expanding  Ma  =  E2ME\Pi  gives  fi(Ma)  =  8.25  with  optimal  scaling  matrix 

'  d\  =  1 
d2  =  0.906 
dz  =  0.406 
d4  =  0.719 

d5  =  0.688  = 

D  = 

d6  =  0.988  = 
d7  =  0.783 
d&  =  0.560  = 

d9  =  0.636  = 

Of  course  this  is  identical  to  scaling  matrix  b  from  Example  4.1  found  by  optimizing 
over  n2  —  1  variables.  In  this  case,  the  added  complexity  of  computing  the  dependent 
elements  of  D  is  more  than  offset  by  the  savings  in  computations  resulting  from  the 
reduced  number  of  optimization  variables. 

Tables  5.1  through  5.3  summarize  the  savings  in  floating  point  operations  (flops) 
due  to  the  scaling  method  of  this  section.  The  “Size”  columns  represent  the  dimension 
of  expanded  matrix  Ma  while  the  number  of  optimization  variables  (n2-l,  or  2(n-l)) 
column  distinguishes  between  the  old  and  new  scaling  methods  respectively.  Both 
optimization  meinods  were  implemented  using  a  MATLAB™  based  quasi-Newton 
method  with  analytic  derivatives  [32,  33].  Also,  the  starting  point  D  =  /  was  chosen 
for  both  methods  to  make  the  comparison  as  equitable  as  possible.  This  initial  choice 
for  D  is  probably  as  good  as  any  other  for  randomly  generated  M  and  P  matrices. 
However,  stability  analysis  of  an  actual  system  using  a  frequency  sweep  could  use  the 
optimum  D  from  a  previous  frequency  point  as  an  improved  initial  point. 


For  the  4x4  systems,  the  average  flop  count  required  to  compute  //  using  n2  —  1 


System 

Size 

Optimization  Variables 

%  Difference 

bob 

2(n  —  1) 

1 

4 

85,847 

70,669 

-17.7 

2 

4 

84,463 

55,117 

-34.7 

3 

4 

102,588 

95,816 

-6.6 

4 

4 

83,236 

74,326 

-10.7 

5 

4 

83,239 

62,915 

-24.4 

6 

4 

67,001 

57,630 

-14.0 

7 

4 

75,247 

61,280 

-18.6 

8 

4 

62,237 

73,254 

+17.7 

9 

4 

78,078 

43,474 

-43.6 

10 

4 

61,244 

58,959 

-3.7 

11 

4 

77,018 

34,567 

-55.1 

12 

4 

51,155 

48,680 

-4.8 

13 

4 

88,406 

73,190 

-17.2 

14 

4 

53,459 

46,563 

-14.7 

15 

4 

57,321 

46,417 

-19.0 

16 

4 

59,942 

49,500 

-17.4 

17 

4 

47,572 

33,124 

-30.3 

18 

4 

56,281 

51,606 

-8.3 

19 

4 

76,508 

61,132 

-20.1 

20 

4 

89,680 

54,088 

-39.7 

21 

4 

63,493 

68,793 

+8.3 

22 

4 

75,972 

58,982 

-22.4 

23 

4 

74,279 

101,329 

+36.4 

24 

4 

77,029 

53,882 

-30.04 

25 

4 

76,402 

39,261 

-48.6 

Average: 

75,637 

57,955 

-23.4 

Table  5.2:  FLOP  Count  Comparison  for  9  x  9  Systems 


System 

Size 

Optimization  Variables 

%  Difference 

n2-  1 

2(n  —  1) 

1 

9 

1,178,572 

1,240,971 

5.3 

2 

9 

1,188,389 

600,863 

-49.4 

3 

9 

1,298,812 

991,019 

-23.7 

4 

9 

1,079,390 

810,888 

-24.9 

5 

9 

1,146,836 

785,305 

i 

CO 
►— * 

bx 

6 

9 

1,924,664 

1,315,678 

-31.6 

7 

9 

2,320,320 

1,172,074 

-49.5 

8 

9 

1,596,230 

1,332,371 

-16.5 

9 

9 

1,356,005 

1,182,523 

-12.8 

10 

9 

1,591,708 

1,294,051 

-18.7 

11 

9 

772  825 

606,518 

-21.5 

12 

9 

784,541 

922,383 

+17.5 

13 

9 

897,372 

798,991 

-11.0 

14 

9 

1,093,402 

1,090,522 

-5.8 

15 

9 

906,052 

501,610 

-44.6 

16 

9 

824,228 

847,159 

+2.8 

17 

9 

820,406 

731,396 

-10.8 

18 

9 

1,045,708 

865,101 

-17.3 

19 

9 

897,115 

605,850 

-32.5 

20 

9 

1,538,927 

982,515 

-36.2 

21 

9 

1,366,940 

938,130 

-31.5 

22 

9 

932,012 

722,522 

-22.5 

23 

9 

1,190,635 

581,242 

-51.2 

24 

9 

943,404 

990,477 

+4.9 

25 

9 

1,270,973 

831,299 

-17.5 

Average: 

1,167,769 

909,658 

-22.0 

Table  5.3:  FLOP  Count  Comparison  for  16  x  16  Systems 


Optimization  Variables 


System 

Sne 

n2  —  1 

2(n  —  1)  | 

%  Difference 

1 

16 

8,124,858 

5,633,607 

-30.7 

2 

16 

9,016,172 

4,483,259 

-50.3 

3 

16 

10,886,765 

9,511,034 

-12.6 

4 

16 

8,247,534 

6,271,272 

-24.0 

5 

16 

8,519,069 

5,544,709 

-34.9 

6 

16 

9,450,057 

5,919,971 

-37.4 

7 

16 

10,789,909 

9,097,814 

-15.7 

8 

16 

9,574,741 

8,900,847 

-7.0 

9 

16 

10,896,836 

8,523,835 

-21.8 

10 

16 

11,425,524 

7,454,489 

-34.8 

11 

16 

8,525  125 

6,322,816 

-25.8 

12 

16 

8,514,427 

5,807,702 

-31.8 

13 

16 

9,030,860 

8,373,217 

-7.3 

14 

16 

7,637,353 

5,854,87S 

-23.3 

15 

16 

10,589,478 

5,854,878 

-43.9 

16 

16 

10,629,956 

6,544,465 

-38.4 

17 

16 

9,661,623 

8,417,042 

-12.9 

18 

16 

9,175,808 

5,742,484 

-37.4 

19 

16 

7,354,233 

6,232,710 

-15.3 

20 

16 

7,611,841 

5,889,917 

-22.6 

21 

16 

10,710,469 

6,438,349 

-39.9 

22 

16 

11,214,534 

9,057,426 

-19.2 

23 

16 

7,954,635 

6,603,402 

-17.0 

24 

16 

9,350,555 

6,028,401 

-35.5 

25 

16 

8,541,532 

7,901,464 

-7.5 

Average: 

8,996,696 

6,896,279 

-23.3 
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variables  is  75,637  versus  57,955  using  2 (n  —  1)  variables:  a  savings  of  23.4%.  As 
system  size  increases,  the  flop  count  savings  do  not  change  appreciably  with  an  average 
savings  of  22.0%  for  the  9x9  systems  and  23.3%  for  the  16  x  16  systems.  Since 
/i  must  be  determined  over  a  wide  frequency  range  for  each  transfer  function,  the 
savings  offered  by  the  scaling  method  of  Theorem  5.1  promise  substantial  reductions 
in  computational  effort  for  the  design  of  robust,  multivariable  control  systems.  Note 
that  in  some  of  the  examples  shown  the  reduced  scaling  method  actually  requires 
more  flops  than  the  full  optimization  over  n2  —  1  variables.  This  occasional  anamoly 
is  probably  due  to  the  initial  guess  of  D  =  I.  For  the  unreduced  case,  all  diagonal 
elements  of  D  may  be  set  to  1,  while  the  reduced  structure  only  allows  the  free 
elements  to  be  set  to  1.  For  a  few  of  the  examples,  having  all  diagonal  elements  of 
D  =  1  apparently  provides  the  unreduced  method  with  a  superior  initial  guess  for 
the  optimization. 

Reducing  the  number  of  optimization  variables  offers  more  than  the  savings  in 
flop  counts.  Depending  on  the  minimization  algorithm  chosen,  a  possibly  substantial 
savings  in  memory  requirements  results  as  well.  For  example,  the  popular  quasi- 
Newton  (such  as  that  used  in  this  work)  routines  require  storage  on  the  order  of 
N2  where  N  represents  the  number  of  optimization  variables  [34].  Along  with  the 
memory  requirements  comes  additional  bookkeeping  to  keep  track  of  the  gradient 
information  for  the  N  variables.  Therefore  the  reduction  from  n2  —  1  to  2(n  —  1)  free 
variables  offers  computational  savings  that  may  not  show  up  directly  in  flop  count 


comparisons. 
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5.2  Effects  of  Zero  Elements  in  P 

Example  5.1  and  the  systems  of  Table  5.1  represent  the  maximum  reduction  of 
free  variables  from  n2  —  1  to  2(n  —  1).  If  elements  of  A  are  replaced  with  zeros, 
this  efficiency  gap  begins  to  diminish  although  the  new  method  is  always  at  least  as 
efficient  in  terms  of  optimization  variables  as  previous  similarity  scaling  methods.  In 
fact,  the  efficiency  advantage  holds  until  the  number  of  zeros  equals  n(n  —  2)  +  1  so, 
for  example,  a  10  x  10  system  with  up  to  80  zero  elements  in  A  will  still  maintain  an 
advantage  in  terms  of  free  variables.  Of  course  the  relationships  shown  in  Equation 
5.8  must  be  altered  to  account  for  the  reduction  in  the  c/,/,.  This  is  d  unonstrated  in 
Example  5.2. 

Example  5.2:  Starting  with  the  same  random  M  and  P  matrices  from  Example 
4.1,  set  p22  and  P33  to  zero.  Then  the  size  of  expanded  matrix  Ma  need  only  increase 
to  7  x  7  instead  of  9  x  9  since  the  zero  elements  of  P  can  be  eliminated.  The  optimal 
D  for  this  example  is  then  found  to  be 


with  p{Ma)  =  6.64.  Note  that  while  the  expression  for  d$  corresponds  to  that  of  de 
in  Example  5.1,  the  expression  for  ^7  changes  slightly  to  account  for  the  reduction  in 
the  number  of  d,/,.  This  example  shows  the  reduction  from  n2  —  1  =  8  to  n2  —  3  =  6 
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for  previous  methods  of  similarity  scaling.  The  new  method,  however,  experiences  no 
additional  reduction  and  remains  at  2 (n  —  1)  =  4. 

5.3  Application  to  Block  Structured  Uncertainties 

As  discussed  in  Section  2.4,  block  structured  uncertainties  represent  an  impor¬ 
tant  class  of  uncertainties  that,  in  general,  cannot  be  addressed  using  nonsimilarity 
scaling  techniques.  Therefore,  one  of  the  more  important  advantages  of  similarity 
scaling  is  the  ease  with  which  general  block  structured  uncertainties  may  be  handled. 
Furthermore,  by  examining  the  relationships  between  the  elements  of  A'i  and  Y\  as 
in  Equation  5.5,  it  is  also  possible  to  reduce  the  number  of  optimization  variables  for 
blocks  of  arbitrary  dimension.  Theorem  5.2  follows  directly  from  Theorem  5.1  and 
the  MPDA  relationships  for  block  structured  uncertainties  discussed  in  Section  4.2. 

Theorem  5.2:  Given  M  €  CnXn  and  A  €  Cnxn,  where  A  is  composed  of  blocks  of 
dimension  less  than  n,  the  solution  for  fi(Mb)  in  the  optimization  problem 

<  \nia{DbMaDbl)  (5.9) 

Db 

with  simple  a  can  be  found  with  no  more  than  2(n  —  1)  free  variables. 

Proof:  The  proof  is  straightforward  given  Theorem  5.1.  Recall  from  Section  3.4 
that  for  block  structured  uncertainties,  scaling  matrix  Db  contains  block-diagonal 
elements  corresponding  to  the  structure  of  P.  For  P  composed  solely  of  1  x  1  blocks, 
Theorem  5.1  reveals  that  no  more  than  2 (n  —  1)  optimization  variables  are  required 
to  compute  fi{Mb).  At  the  other  extreme,  for  P  composed  of  a  single,  diagonal,  block 
structured  uncertainty  of  size  n,  no  optimization  variables  would  be  required  since 
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the  system  effectively  simplifies  to  an  unstructured  uncertainty  problem.  For  systems 
falling  between  these  limiting  cases,  singular  vector  relationships  similar  to  those  of 
Equation  5.5  still  apply  so  the  number  of  optimization  variables  required  to  solve  any 
block  structured  problem  cannot  exceed  2 (n  -  1).  ■ 

Example  5.3  illustrates  the  application  of  the  optimization  variable  reduction 


method  to  block  structured  uncertainties. 

Example  5.3:  Using  the  same  M  and  A  matrices  from  Example  4.2,  form  expanded 
matrix  M(,  followed  by  DMbD~x.  The  rank  of  DMaD~l  must  be  less  than  or  equal 
to  3  so  comparing  the  terms  of  DMaD~l  in  the  form  of  Equation  5.2  with  that  in  the 


form  of  Equation  5.4  requires  the  structure  of  X\  and  Y\  to  be 


“ 

• 

xn 

yn 

X21 

£31 

,  n  = 

2/41 

2/51 

*21 1 

.  X3li . 

(5.10) 


As  in  Example  4.2 


the  optimizing  D  and  the 


absolute  values  of  the  elements  of 


and  Yi  are 


di  =  1 

'  0.427  ' 

‘  0.427  ' 

d2  =  0.82 

0.527 

0,527 

T— < 

O 

II 

^3 

0.187 

0.187 

dA  =  0.63 

,  1*1 1  = 

0.269 

,  \Yi\  = 

0.269 

d5  =  0.87 

0.373 

0.373 

d6  =  0.69 

0.442 

0.279 

d7  =  0.69 

0.313 

0.465 
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with  fi(Mb)  =  6.5.  Applying  the  MPDA  property  for  block  structures  along  with  the 
relationships  of  Equation  5.10  gives  a  simple  expression  for  de  (and  d 7)  in  terms  of 


the  other  d,-/a  and  the  elements  of  P  of  the  form 

r  1 2  1  [fe]2 
_  U1P2J - UimiJL  =  0.2267. 

I  ^ 


4  = 


d]Pi 2 1  4.  fiiml 


(5.11) 


Knowing  that  d e  is  dependent  on  the  elements  of  P  and  the  other  d;i3  allows  the  num¬ 
ber  of  optimization  variables  required  for  this  problem  to  be  reduced  from  5  to  only 
4.  Variable  reduction  for  other  block  structured  uncertainties  may  be  determined  in 
an  analogous  manner  and  will  depend  on  the  number  and  size  of  individual  blocks. 
Although  Equation  5.11  is  more  complicated  than  the  relationships  for  scalar  uncer¬ 
tainties,  the  elimination  of  unnecessary  free  variables  should  produce  a  net  savings  in 
floating  point  operations:  particularly  for  systems  of  high  order. 


5.4  Complete  Solution  to  the  Block  2x2  Problem 

In  the  original  paper  introducing  the  structured  singular  value  concept,  Doyle 
proved  that  for  k  <  3,  where  k  denotes  the  number  of  uncertainty  blocks  in  A,  the 
inequality  of 

n{Mb)  =  sup  p{MbUb)  <  inf  a{DbMbD;1)  (5.12) 

Ub  Db 

is  guaranteed  to  attain  equality  at  the  “inf”  regardless  of  whether  or  not  <7  is  sim¬ 
ple  [15].  For  the  special  2x2  scalar  case  where  A  has  the  form 

Si  62 
S3  84 
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Daniel,  Kouvaritakis  and  Latchman  have  shown  that  equality  of  Equation  5.12  may 
still  be  guaranteed  even  though  k  =  4  [31].  The  proof  for  this  result  depends  on 
formulating  the  problem  in  a  nonsimilarity  scaling  context  and  unfortunately  does 
not  extend  directly  to  handle  block  structured  systems.  However,  by  applying  the 
reduced  scaling  structure  of  the  previous  sections  to  the  original  proof  by  Doyle,  it 
is  shown  that  the  inequality  of  Equation  5.1.2  always  becomes  equality  for  any  block 
structured  uncertainty  of  the  form 

At  A2 

A  = 

A3  A4 

Doyle’s  original  proof  guaranteeing  equality  of  Equation  3.22  for  k  <  3  requires 
only  that  the  number  of  independent  optimization  variables  be  less  than  or  equal 
to  two  [15].  Since  one  of  the  elements  of  scaling  matrix  D  may  always  be  fixed  at 
some  arbitrary  value  without  loss  of  generality,  an  uncertainty  structure  with  three 
or  less  blocks  requires  no  more  than  two  independent  variables  for  the  optimization 
of  inf o  a(DMaD~l).  By  proving  that  a  2  x  2  block  structured  uncertainty  actually 
requires  no  more  than  two  optimization  variables,  this  result  may  be  extended  to 
guarantee  that  fi(Mb)  =  inf^  a{DbMbD^x)  for  general  2x2  block  structured  systems. 

The  following  discussion  briefly  outlines  the  work  of  Doyle  leading  up  to  his  lemma 
regarding  the  equality  of  Equation  5.12  for  two  or  less  optimization  variables.  This 
outline  is  presented  only  as  a  summary  and  the  reader  is  referred  to  [15]  for  a  complete 
treatment  of  this  material. 

Before  reviewing  Doyle’s  work,  a  number  of  terms  must  first  be  defined  start- 
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ing  with  the  notion  of  a  directional  derivative  for  <r{DbMbDbx)  with  respect  to  the 
elements  of  D.  Denote  the  multiplicity  of  7r  as  q  and  decompose  ( DbMbD j"1)  as 

(A  AW)  =  v(DbMbDZ1)XaYaH  +  XbI \YbH 

where  Xa ,  Ya  6  CnXq  for  Mb  €  Cnxn.  Next,  define  some  Hj  such  that 

Hi  =  iff  =  Re(A :”(DMD;‘)Y.). 

A  directional  derivative,  V2,  may  then  be  written  as 

V2  =  {x  €  9?r,  Xj  =  v11  HjV  1 1;  6  C,  =  1} 

where  r  represents  the  number  of  elements  in  the  directional  derivative  and  similarly 
the  number  of  optimization  variables.  Formulated  in  this  manner,  V2  simply  reduces 
to  an  ordinary  gradient  for  q  —  1.  Also,  for  V2  =  0  regardless  of  the  multiplicity  of 
a,  a  true  stationary  point  is  achieved.  From  the  MPDA  theory  this  stationary  point 
assures  that  fi(Mb)  —  inf# a(DbMbDbl). 

The  convex  hull  of  V2,  denoted  coV2,  also  provides  gradient-type  information  and 
for  <?=1,  coV2  likewise  reduces  to  an  ordinary  gradient.  However,  for  <7  >  1,  a  value 
of  coV2  =  0  indicates  only  that  the  “inf”  of  miDb&(DbMbDbl)  has  been  achieved 
with  no  guarantee  of  a  corresponding  stationary  point.  Such  a  situation  may  actually 
be  a  “cusp”  where  n(Mb)  <  inf ob{DbMbDbx). 

Now  let  Hj  =  Hf  €  CqXg,  j  =  1, . . . ,  r  and  define  some  /  :  Cq  — >  9£r  as 

fj{x)  =  xllHjX  for  x  6  Cq . 

Also,  for  some  positive  integer  m,  let 

Pm  =  {x  e  Cm  |  xHx  =  1}  C  Cm 
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and 

Sm  =  {x  e  &m+1  |  xTx  =  1}  C  £m+1. 

Using  these  definitions,  the  following  lemma  shows  that  in  certain  cases,  V2  =coV2. 
In  such  cases,  the  condition  coV2  =  0  implies  that  V2  =  0  and  therefore  signifies  that 
a  stationary  point  has  been  found. 

Lemma  5.1:  For  /  :  Cq  — >  3£r  defined  as  above,  if  r  =  1  or  2,  then  f(Pq )  is  convex. 

Proof:  See  [15]. 

Lemma  5.1  implies  that  regardless  of  the  multiplicity  of  <7,  if  no  more  that  2 
optimization  variables  are  required  to  solve  Equation  5.12,  then  V2  =coV2  and  the 
minimum  of  the  infimization  must  be  a  stationary  point  and  hence  equals  fi(Mb)- 
The  following  theorem  reveals  that  no  more  than  2  free  optimization  variables  are 
required  to  solve  Equation  5.12  for  systems  with  2x2  block  structured  uncertainties 
allowing  the  results  of  Lemma  5.1  to  be  invoked. 

Theorem  5.3:  For  the  case  of  2  x  2  block  structured  uncertainties,  no  more  than 
two  optimization  variables  are  required  to  solve  infpfc  regardless  of  the 

individual  block  sizes. 

Proof:  Define  M  €  Cnxn  and  2x2  uncertainty  matrix  A  with  corresponding  P 
matrix  as 


1 

■ 

“ 

A, 

a2 

->  P  = 

Pl'Im 

Pl'Im 

A3 

A4 

P3  ■  Im 

P4  ■  im 

with  P  €  Si*1*",  Pi  €  3£mxm  and  m  =  n/2.  Then  similarity  scaling  matrix  D  has  the 
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structure 


D  —  dio>Q\d\ )  *  *  t )  d\  ,  d2  ,  •  •  • ,  d2^  d3,  •  *  •  >  ^3>  ^4,  •  •  •  5  ^4]  • 


Expanding  the  system  to  the  form  of  Equations  5.2  and  5.4  and  comparing  terms 
requires  the  elements  of  X\  and  Y\  to  be 


•Em,  1 
*E(m+l),l 


Vm,l 


•E2m,l 

Al  =  ,4l 

XUdi 


ym>la(Pl)d2 

2/(2m+l),l 


(5.13) 


O’  1 

d, 

X(m+l)il^2 


2/(3m),l 

2/(2m+l),l=n^tj7 


X2m,l^ 


«>/ 

2/(3m),l  or(/>3)d4 


Applying  the  MPDA  requirement  for  block  structured  uncertainties  to  Equation  5.13 
gives  a  relationship  between  the  elements  of  D  as 

d  =  yjPiPW 
4  di  \a{P2)a(P3) 

Fixing  d\  =  1  leaves  only  d2  and  d3  as  independent  variables  completing  the  proof.  ■ 
Example  5.4:  As  a  simple  example,  consider  the  following  4x4  problem  with  four 

2x2  block  structured  uncertainties. 

'  0.8  +  j0.3  1.0  +  ;0.9  0.1+;0.5  0.4  +  ;0.5‘ 

1.0  +  jOA  0.7  +  ;0.9  0.6  +  ;0.5  0.8  +  ;0.3 

M  ~  0.4  +  ;0.2  0.8  +  ;0.1  0.9  +  .;0.3  0.5  +  ;0.1 

.  0.2+;0.5  0.7  +  j0.9  0.3-f;1.0  0.2  +  ;0.9. 


A  =  Al 

a3  a4 


10  2  0 
0  10  2 
3  0  4  0 
0  3  0  4 


with 
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The  singular  vector  relationships  of  Equation  5.13  become 


For  this  particular  example,  the  optimal  D  is 

d\  —  1 
d\  —  1 
d2  =  1.38 
d2  =  1.38 
D  -  d3  =  1.65 
d3  =  1.65 

d.,  =  1  86  =  dzdi .  klK4) 

a4  -  i.oo  -  rii  y  (2)(3) 

d4  =  1  86  =  ^  i 
cm  i.ou  dt  y  (2)(3) 

with  fi(Mb)  —  16.43. 


The  previous  sections  show  that  when  n  is  found,  the  corresponding  optimal 
scaling  matrix  D  exhibits  a  degree  of  redundancy  which  allows  a  reduction  in  the 
number  of  independent  optimization  variables.  It  is,  however,  important  to  ensure 
that  when  the  relationships  between  the  d{>3  and  are  incorporated  in  a  reduced 
scaling  structure  from  the  outset,  the  convexity  properties  of  the  original  problem 
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are  not  lost.  The  remainder  of  this  section  will  show  that,  in  fact,  these  important 
convexity  properties  do  still  apply.  Furthermore,  MPDA  conditions  used  to  establish 
equality  of  Equation  3.22  for  the  full  scaling  structure  also  pertain  to  the  reduced 
scaling  structure  guaranteeing  its  convergence  to  /z. 

Convexity  of  a2(DMaD~l)  in  terms  of  eD,  D  was  first  proven  by  Safonov  and 
Doyle  [28].  Subsequent  authors  have  offered  alternative  proofs  in  somewhat  simpler 
forms  [35,  29].  The  following  lemma  concerning  the  convexity  of  some  function  / 
was  proven  by  Tsing  [36]. 

Lemma  5.2:  Suppose  Vrr,  gx :  1ft  1ft  twice  differentiable  such  that  /(: x)  =  gx{x), 
f(t)  >  gx(t )  and  ■~^gx{t)\t=x  >  0.  Then  /  is  convex. 

Proof:  See  [36]. 

Lemma  5.2  may  then  be  applied  to  the  scaled  singular  value  problem  to  prove 
convexity  of  a(eDMae~D).  For  notational  brevity,  denote  Mx  =  eDxMae~Dx  with 
singular  vectors  u  and  v  such  that  f(x)  =  a(Mx)  =  uHMxv.  Next,  define  gx(t )  = 
$t{uHeDlMae~Dlv}.  Since  f(t)  >  gx(t)  and 

jf29x(t)\t=x  =  ${uh(D2Mx  -  2DMXD  +  MxD2)v} 

=  f(x)(uHD2u  +  vhD2v )  -  2${uI{DMxDv) 


uH D  vnD) 

J(x)I  -M„ 

Du 

-M«  f(x)I 

Dv 

Therefore,  by  Lemma  5.2  /  is  convex. 

This  convexity  property  guarantees  that  any  local  minimum  of  a(DMaD~l)  with 
respect  to  D  is  always  a  global  minimum.  For  a  new  scaling  matrix,  S,  that  inherently 
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incorporates  the  reduced  scaling  structure  of  Equation  5.8,  the  convexity  lemma  still 
applies  since  S  is  a  subset  of  V.  Therefore,  it  must  be  shown  that  the  value  of 
a(SMaS~x)  at  the  minimum  always  corresponds  to  fi  and  not  some  other  value. 

Theorem  5.4:  For  diagonal  scaling  matrices  D,S  €  3J"2xn2  where  S  conforms  to 
the  scaling  structure  of  Equation  5.8,  the  following  equality  must  hold 

mia(DMaD~x)  =  inf  a(SMaS~x) 

whenever  a  stationary  point  occurs  at  the  “inf”  of  either  side  of  the  equality. 

Proof:  This  proof  requires  examination  of  the  stationary  points  at  which 
— =  0.  Any  maxima  of  a(SMaS~x)  with  respect  to  S  occurs  at  infinity 
because  increases  without  bound  with  the  free  elements  of  S.  This  eliminates 
the  possibility  of  any  stationary  point  corresponding  to  a  maxima.  Inflection  points 
are  not  possible  due  to  the  convexity  properties  of  es  so  any  stationary  point  must 
coincide  with  a  minima.  Again,  the  convexity  property  requires  any  local  minimum 
to  be  a  global  minimum  so  if  a  stationary  point  can  be  found,  it  must  correspond  to 
the  unique  minimum. 

Because  the  maximum  singular  value  is  always  a  positive  function  of  some  complex 
matrix  A  €  CnXm,  the  stationary  points  of  a2(A)  correspond  to  those  of  a{A).  This  al¬ 
lows  an  alternative  form  of  the  singular  values  to  be  written  in  terms  of  the  eigenvalues 
giving  a2(A)  =  Ai(A;/A)  where  Ai  represents  the  eigenvalue  of  maximum  magnitude. 
Denoting  the  eigenvector  corresponding  to  Ai  as  W\,  the  eigenvalue/eigenvector  rela¬ 
tionship  may  be  written  as 


AHAWX  =  A  XWX 


(5.14) 
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and  the  derivative  with  respect  to  some  variable  x  as 

BAmmXim+wn 

ox  ox  ox  ox 

Multiplying  on  the  left  by  Wf1  gives 

w?*g*wt + =  w?  xM + 

OX  OX  OX  UX 


which  can  then  be  simplified  to 


OS  ox 


(5.15) 


giving  an  expression  for  the  derivative  of  Aj  with  respect  to  x.  The  derivative  with 
respect  to  some  element  of  scaling  matrix  S  can  then  be  found  by  replacing  the 
variables  of  Equation  5.14  such  that 

A  =  AH  A  —  S~1M^D2MaS~1 


Wi  =  Yx 


Aj  =  a2. 


The  differentiation  will  first  be  carried  out  for  a  2  x  2  case  with  respect  to  element 


S22  of  S  where 


S  —  diag[sn,  S22>  533,  S22S331 

V  P12P21 


First  decompose  S  such  that 


S  —  So  +  522-^22  +  522533  k 


where  ,%  =  diag[su,  0,  S33,  0],  k  =  and  the  E,t  are  n  x  n  matrices  with  a 

1  at  the  nth  position  and  0  elsewhere.  Likewise,  S~l  may  be  decomposed  as 

0-1  _  c-i  ,  £22  ,  Eu 

O  —  Oq  i  I  r. 

522  S22S33K 
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Differentiation  of  A  with  respect  to  S22  then  proceeds  directly  with 


-^M?S2MaS~x  -  M?S2MaS~x 

522  5^2S33ft 

522  522533K_ 

+25225-1Mf  +E445l3PMa5'-1 


The  Exi  terms  contain  only  one  nonzero  value  so  this  expression  may  be  rewritten  as 


~E22S-xM^S2MaS-x  -  EA4S~l S2MaS~x 

522  522S33« 

-  S-£s-xM?S2MaS-xE22  -  S-xM?S2MaS~'E4 

522  522S33K 

+  2snS~l  EnMaS-x  +  2s22s2zk2S-x  E^MaS~x . 


Multiplying  on  the  left  by  Y^  and  the  right  by  Yi  as  in  Equation  5.15,  gives  an 
expression  for  of 

-  -2pWl  +  2W,] 

C/S22  C/S22  522  522 

+2522^  [S"1  +  s23k2S-xM”EuMaS-x}  Yi. 


Noting  that 


—  -  2j  — 

(9522  ^522 5 


YlHS-lM?SEii  =  aX”EiU 


a  final  expression  for  -if-  may  be  written  as 


=  —  [*?(£«  +  iW  -  K"(£22  +  Eaa)Yx\ 

OS22  522  1  J 


(5.16) 
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Carrying  out  this  process  with  respect  to  S33  gives  an  expression  similar  to  Equation 
5.16  of 

=  —  [*?(£33  +  E„)X,  -  Y»(E3Z  +  E«)YX\  ■  (5.17) 

OS33  S33  1  1 

Setting  Equations  5.16  and  5.17  to  zero  establishes  conditions  at  which  the  new  scaling 
technique  reaches  its  global  minimum.  The  stationary  point  therefore  corresponds  to 
the  value  of  S  where 


[x2ix2\  -  y2\V 21  +  Z41Z41  -  y41?/4i)  =  0  (5.18) 

and 

[^31®31  -  2/312/31  +  £41*41  ~  £41^1]  =  0  (5.19) 

simultaneously. 

The  MPDA  property  guarantees  that  when  element-by-element  alignment  of  the 
principal  singular  vectors  of  a(5Ma5_1)  occurs,  some  Ud  exists  such  that  a(SMaS~l)  = 
p{MaUd)  =  From  Equations  5.18  and  5.19,  such  an  alignment  of  X\  and  Y\ 

always  coincides  with  a  stationary  point  of  — 5|^5-  ^  because  it  requires  that 

1 3-21 P  ~  |2/2l|2  =  |*3lj2  -  1^31 12  =  1*41  P  “  |2/4l|2  =  0. 

Therefore  one,  and  hence  the  only,  minima  of  a(5M05-1)  with  respect  to  S  must 
correspond  to  fi(Ma )  as  long  as  some  stationary  point  does  exist. 

Extension  to  higher  order  systems  is  simply  a  matter  of  determining  which  ele¬ 
ments  of  S  depend  on  a  particular  s,-.  For  each  occurrence,  an  additional 


XfEjjXi  -  YfEjjYt 
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expression  is  included  where  the  jj  terms  denote  the  elements  of  S  dependent  on  a 
particular  s,-  term.  Again,  general  n  x  n  systems  similarly  invoke  MPDA  conditions 
guaranteeing  that  the  stationary  point  corresponds  to  n(Ma).  ■ 


CHAPTER  6 

DIRECT  RELATIONSHIP  BETWEEN  R,  L%  AND  D 
6.1  Problem  Formulation 

The  results  of  Chapter  5  show  the  number  of  optimization  variables  required  for 
similarity  scaling  to  be  no  more  than  that  required  for  nonsimilarity  scaling.  It  would 
therefore  seem  consistent  that  some  direct  relationship  exists  between  scaling  matrices 
D  for  infogp  ^(DMqD-1)  and  R  and  L  for  inf^o^R-1  M L~x)a(LP R)  since  both 
methods  accurately  compute  /z.  The  following  Theorem  presents  such  a  relationship. 

Theorem  6.1:  For  element-by-element  structured  uncertainties,  given  some  scaling 
matrix  D  conforming  to  the  structure  of  Equation  5.8,  a  corresponding  R  and  L  of 


the  form 


r2  =  +  4+1  +  d%n+l  •  •  •  <ff(n-l)-U  ■  _  o 

'  +  <£+,•  + <&,+,•  •  ••<(„-!)+; 


p?  i  i  Pi2  . . .  gLa 
df  +  df  dj 


rl2  T  J2  772 

a, »(»-!)+!  an($-l)+2  an(i-l)+n 


i  =  2, . . . ,  n 


may  be  found  without  further  optimization  such  that 


a{{R-'ML-x)a(LPR )  =  ai(DMaD~l)  i  =  1, . . .  ,n. 


Proof:  Rather  than  dealing  explicitly  with  singular  values,  it  is  more  convenient 
in  this  case  to  use  the  equivalent  representation 


<ri(A)  =  \AM 


Aecnxn,  i  =  1, . . . ,n 
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By  similarly  expanding  /?,  B  and  C  are  found  to  be 


K  •  ^M\\M\\M22^22  4*  M21M21M  12-^12] 

—  K  •  \\M\2M 22M21  +  M \\M\2M 2\M2^ 

[j2  ,  j2l  r  j2  ,  J2l  P11P21  ,  P11P22  ,  P12P21  ,  P12P22 
[d,  +  rf3|  •  \i,  +  d,\  ■  _  +  _  +  —  +  _ 


Equating  ba2(LPR)  =  B  and  c/74(LPR)  =  C ,  the  following  direct  relationships 


between  R ,  T,  and  .D  will  hold 


,2  _  \dl  +  4]  ,2  =  Ji 

2  +  2  4 


p?i  1  £12 

a  ^  a 


£21  J. 


but  only  when  equality  constraint 


*(iWJ) -[*  +  «]•[$  +  | 


applies.  At  first  glance  it  may  appear  that  the  constraint  of  Equation  6.5  severely 
limits  conditions  under  which  Equation  6.4  holds.  The  remainder  of  the  proof  will 
reveal,  however,  that  this  constraint  simply  requires  the  reduced  similarity  scaling 
structure  developed  in  Chapter  5. 

The  left  hand  side  of  Equation  6.5  may  always  be  represented  as  \\  [(LPR)n(LPR)]. 
Expanding  matrix  (LPR)H (LPR)  gives 

„  Pll+P21^2  PnPl2V2  +  P2lP22llr2 

0 LPRfiLPR )  = 

P\lP\2r2  +  P2lP22^2r2  Pl2r2  +  P22rVl 
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The  corresponding  characteristic  equation  of  this  matrix  becomes 

A2  +  b\  +  c. 

A 

Denote  the  two  roots  of  Equation  6.7  as  a2(LPR)  and  a2(LPR).  Coefficients  b  and 


c  may  be  expressed  in  terms  of  these  roots  such  that 

b  =  a\LPR)  +  al(LPR)  (6.6) 

c  =  a2(LPR)  •  al(LPR).  (6.7) 

Relating  these  expressions  to  the  elements  of  ( LPR )  gives 

b  =  a2(LPR)  +  a\{LPR)  =  p2n  +  p22ll\  -f  p\2r22  +  p222r\l\  (6.8) 

and 

c  =  a2(LPR)  •  a2{LPR)  (6.9) 

=  (Pll  +  VIA)  •  (Pl 2r2  +  P22rlll)  (6‘10) 

~  (PllPl2r2  +  P21P22&2)  •  (pilPl2r2  +  p2ip22/2r2).  (6.11) 


Replacing  r2  and  /2  in  Equations  6.8  and  6.11  with  the  relationships  of  Equation  6.4 
and  simplifying  gives 
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Equation  6.7  shows  coefficient  c  to  be  the  product  of  a2(LPR)  and  a\{LPR).  Com¬ 


bining  Equations  6.5  and  6.12  then  gives  the  expression  for  a2{LPR )  of 

2  \PnP22  —  P12P21]2 

a2~[di  +  di)  [4  +  4]* 

3  4 


(G.13) 


Equation  6.7  also  shows  that  coefficient  b  is  the  sum  of  a2{LPR)  and  a2{LPR) 
so  that  b  may  likewise  be  written  as 


Uk+4  §+i|]+j£n£*-W 

1  1  K  4 J  [dS  +  df].  4  +  2 


This  gives  a  new  expression  for  b  that  must  be  satisfied  along  with  Equation  6.8  so 


P11  1  p?2 


2,2  r^+^i ,  2  .  2 

b  —  pn+pl2  ,2  ,  ,2  +P21  p2  IT"  +P22  .2  ,  J2  ■  P  ^7 

[«2  +  “3J  L“2  +  a^J  2^  +  £gt 

.  “3  “u  L  “3  “4 


=  te  + 41-  [§  +  §]+  [Pn?22->W 
11  31  K  <H\  [<2  + <*?]•[$  +  $  ] 


Multiplying  out  these  two  expressions  and  eliminating  common  terms  reveals  that 
they  are  equal  whenever 


<MPi2P2i  +  iVlr]\Ph  -  ’itt\iS/i/\pnVnP2iVri  =  0. 


(6.14) 


Equation  6.14  can  be  rewritten  as 


\d\dlpnP21  -  d\dlpnP22^  =  0 


so  the  constraint  of  Equation  6.5  will  hold  for  D  and  P  such  that 


d<i  =  d2 


fPnPn 

P12P21 
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This,  however,  is  the  same  dependent  relationship  between  the  py/s  and  d{>3  of  Equa¬ 
tion  5.8  so  the  direct  relationship  between  R  and  L  of  nonsimilarity  scaling  and  D  of 
similarity  scaling  will  always  exist  whenever  the  new  scaling  structure  is  invoked. 

For  the  n  x  n  case,  a  general  expression  for  R  and  L  as  a  function  of  D  may 
be  derived  in  an  analogous  manner  by  comparing  the  characteristic  equations  of  the 
corresponding  matrix  systems.  For  M  G  C3x3,  the  characteristic  equations  would 
contain  three  coefficients  which  must  then  be  matched  up  as  before.  While  intuitively 
simple,  tb'*  -ome  too  unwieldy  to  present  here.  However, 

r  r  ....  ^ation  6.4  may  be  written  as 


and 


with 


ax(R-lML'l)a{LPR)  =  <Xi(DMaD~x)  =  n(Ma)  =  8.25 
a2{R~x  M  L~x)a(LP  R)  =  a2{DMaD-x)  =  5.72 
a3(R-xML~x)a(LPR)  =  a3(DMaD~l)  =  1.76. 
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A  A  A 

It  should  be  noted  that  this  transformation  not  only  provides  L  and  72  given  D , 
but  also  allows  the  computation  of  £>  given  R  and  L.  However,  this  reverse  trans¬ 
formation  results  in  a  transcendental  function  that  must  be  solved  iteratively.  The 
iterative  solution  is  still  quite  fast  since  it  involves  only  elementary  math  operations: 
no  singular  value  or  eigenvalue  decompositions. 


6.2  Independence  from  M 

An  interesting  observation  concerning  Equations  6.1  and  6.2  is  that  they  are  not 
functions  of  M.  This  independence  from  M  requires  that  any  P,  R,  L ,  D  combination 
satisfying  Equation  6.1  for  a  given  M,  must  also  satisfy  Equation  6.3  for  any  other 
M.  As  a  simple  example  conshi  ?r  the  lollowing  system  with  two  randomly  generated 
M  matrices. 

0.87  —  *2.01  4.70  —  z'0.08  4  3 

Mi  —  M2  = 

4.16  -  i0.59  -0.92 +  *'0.13  2  1 

1.59  1.64 

P=  D  =  diag[l,  1.415,  1.401,  1.057) 

2.81  0.83 

where  D  satisfies  the  new  scaling  structure.  Values  for  R  .nd  L  are  found  to  be 


R  =  diag[ 1,  0.974]  L  =  diag[  1,  0.912] 
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so  that  Equation  6.3  becomes 

ai(R~l  MiL~l)a(LPR)  =  ax{DMalD^)  =  20.28 
a2{R-lMiIrl)a{LPR)  =  a2(DMalD~1)  =  13M. 

Using  the  same  R,  L,  and  D  values  for  M2  with  no  optimization  gives 

<rx{R-lM2L-l)a{LPR)  =  ax{DMa2D~l)  =  19.18 
cr2(R~l  MiL~l)a(LPR)  =  a2{DMa2D~l)  =  1.34. 

The  direct  relationship  between  R,  L  and  D  leads  to  the  following  corollary  con¬ 
cerning  systems  that  “cusp”  during  the  optimization  process. 

Corollary  6.1:  Systems  with  element- by-element  structured  uncertainties  with  non¬ 
simple  a(R"1  M L"1)  (“cusps”)  for  nonsimilarity  scaling  must  also  produce  cusps  for 
similarity  scaling  techniques  and  vice-versa. 

Proof:  The  proof  follows  directly  from  the  one-to-one  correspondence  between  the 
(Tiis  of  (Xi(R~l M L~l)a(LP R)  and  ai(DMaD~1)  required  by  Equation  6.3.  ■ 

Corollary  6.1  serves  to  eliminate  the  appealing  notion  that  a  one  of  these  two 
scaling  techniques  may  converge  to  (i  even  if  the  other  method  fails.  While  some 
alternative  scaling  technique  may  eventually  overcome  the  difficulties  associated  with 
cusping  systems  it  is  the  case  for  nonsimilarity  or  similarity  scaling. 

Besides  ruling  out  the  use  of  an  alternative  scaling  method  for  cusping  systems, 
the  direct  relationship  between  R ,  L,  and  D  serves  to  improve  the  vector  optimization 
method  of  Fan  and  Tits.  The  next  chapter  includes  a  discussion  of  this  method  and 
the  improvements  offered  by  the  R ,  L,  D  relationship. 


CHAPTER  7 

REDUCTION  OF  OPTIMIZATION  VARIABLES  REQUIRED  FOR  THE 

METHOD  OF  FAN  AND  TITS 

7.1  Constrained  Vector  Optimization 

While  the  scaling  structure  developed  in  Chapter  5  allows  a  reduction  in  the  num¬ 
ber  of  optimization  variables  required  to  compute  /z,  the  process  still  involves  a  large 
number  of  singular  value  decompositions.  To  avoid  these  computationally  intensive 
decompositions,  Fan  and  Tits  introduced  a  constrained  vector  optimization  proce¬ 
dure  that  calculates  /z  using  a  series  of  vector  norms  [37],  The  problem  formulation 
is  similar  to  the  eigenvalue/eigenvector  relationship  for  matrix  A  6  CnXn 

AWi  =  XiWi 

where  represents  an  eigenvector  and  A,  the  corresponding  eigenvalue.  Since  /z(M0) 
is  actually  the  solution  to  sup u  p(MaU),  there  exists  some  supremizing  U  such  that 

\\MaUWx\\2  =  ||A  ,{MaU)\Vx\\2  =  p(MaU) \\Wih  =  n{Ma). 

For  HVF1II2  =  1,  this  reduces  to  \\MaUW\\\2  =  p(MaU )  =  p(Ma).  Defining  vector 
x  =  UW\  then  gives 

||  Maxh  =  u(Ma).  (7.1) 

Computing  the  2-norm  of  a  complex  16  x  16  vector  requires  about  4n  or  64  flops  versus 
more  than  50,000  flops  for  the  singular  value  decomposition  of  a  complex  16  x  16 
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matrix.  Therefore,  optimizing  the  left-hand  side  of  Equation  7.1  offers  potentially 
significant  numerical  advantages  in  the  calculation  of  fi. 

The  optimization  must  first  be  constrained  to  include  only  “x”  vectors  of  the  form 
UW;.  This  constraint  is  imposed  by  the  Fan  and  Tits  formulation  for  f.i(Ma ): 

n(Ma)  =  max  {||Max||2 1  ||'P,x||2||Max||2  =  ||7Wax||2,  i  =  l,...,m}  (7.2) 

zee**2 

where  the  Vi>a  are  projection  matrices  with  block  size  m  and  x  €  C"2.  Unfortunately, 
this  function  is  nonconvex  so  it  offers  only  candidate  solutions.  Determining  whether 
these  candidates  equal  n(Ma)  involves  computing  a  corresponding  scaling  matrix  D 
from  x  and  ensuring  that  ||Max||2  =  a(DMaD~l).  Also,  in  its  original  formulation, 
this  expression  requires  n2  —  1  optimization  variables.  The  following  Theorem  shows 
that  the  scaling  structure  introduced  in  Chapter  5  applies  to  Equation  7.2  so  the  Fan 
and  Tits  method  actually  requires  no  more  than  2(n  —  1)  optimization  variables. 

Theorem  7.1:  Given  M  €  CnXn  and  A  6  Cnx”,  where  A  is  composed  of  1  x  1 
blocks,  the  solution  for  n(Ma)  in  the  constrained  optimization  problem 

n{Ma)  =  max  (||Max||2 1  ||7?{xj|2||Afax||2  =  \\ViMax\\2)  i  =  1,. . .  ,m) 
xecn2 

with  simple  <7  can  be  found  with  no  more  than  2(n  —  ) )  free  variables. 

Proof:  As  in  Section  5.5,  denote  the  reduced  scaling  structure  of  Equation  5.8  as 
S.  Let  S  be  the  solution  to  inf sa(5Ma5-1)  and  Y\  the  corresponding  right  singular 
vector.  Then  the  optimal  solution,  x,  to  Equation  7.2  may  be  written  as  [38] 

S~xYx 

WS-'Yxh 


x  = 
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The  dependent  relationships  between  the  elements  of  Yx  may  be  determined  using 
the  method  of  page  46.  Since  Fj  is  a  singular  vector  of  SMaS~\  any  scalar  multiple 
of  Yx  will  also  be  a  singular  vector  of  SMaS~K  Therefore  it  can  be  scaled  such  that 
]/n  =  1  without  loss  of  generality.  Likewise,  sx  may  also  be  scaled  to  1.  Carrying  out 
the  multiplication  of  S  and  Yu  results  in  a  general  expression  of  the  form 

1 

2/2,1 

Vn,l 
2/n+l,l 
2/2,l2/n+l,l 

2/n,l2/n+l,l 
Z/2n+l,l 
2/2,l2/2n+l,l 

J/n,l2/(n-l)n+l,l 

Subtracting  the  (n  -  l)2  fixed  variables  of  Equation  7.3  from  the  n2  -  1  free 
variables  previously  required  leaves  the  desired  result  of  n2  -  1  -  (n  -  l)2  =  2(n  -  1) 
free  optimization  variables.  ■ 

Example  7.1  illustrates  the  application  of  Equation  7.3  to  the  system  of  Example 
4.1. 


Example  7.1:  Using  the  system  of  Example  4.1,  the  optimal  ■S'-1}'!  may  be  written 


7/n4^ 

1,11  ajjPll 


S'lYi  = 


2/41  life 


Scaling  7/11  and  Si  to  1  gives 


S~XYX 


with  corresponding 


■  a^pnSi 


2/71  ife 

2/71T£21-- 

sipnsi 


1  I  1 

7/21  1.236 

7/31  1.094 

7/41  0.855  +  0.519; 

2/2i2/4i  =  1.056  +  0.641; 

2/3i2/4i  0.935  +  0.567; 

7/ri  0.355  +  0.935; 

2/2i2/7i  0.438+  1.155; 

2/3i2/7i  .  .  0.388  +  1.023; 


\\Max\\2  =  8.25  =  a(DMaD-x). 


7.2  Application  of  the  Relationship  Between  R.  L  and  S 


As  mentioned  earlier,  the  solution  to  Equation  7.2  is  nonconvex  with  many  local 


maxima  possible.  The  only  means  of  ensuring  that  a  candidate  solution  vector,  x, 
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A 

actually  corresponds  to  fi(Ma)  is  to  determine  optimal  scaling  matrix  D  such  that 

IIAMh  =  aiDMaD-1)  =  fi(Ma).  (7.4) 

(The  method  for  directly  computing  D  given  x  is  presented  in  Section  8.6.)  Unfortu¬ 
nately,  this  singular  value  decomposition  must  be  performed  on  the  expanded  n2  x  n2 
system  for  each  candidate  solution,  reducing  the  overall  efficiency  of  this  technique. 
However,  using  the  direct  transformation  from  D  to  R  and  L  presented  in  Chapter 
6,  the  verification  test  for  may  be  reformulated  as 

||Maz||2  =  a(R~l  M  L~l)a(LP  R)  (7.5) 

where  the  left-hand  side  consists  of  only  n  x  n  matrices.  For  a  system  with  n  =  3, 
the  expanded  n2  x  n2  matrices  of  Equation  7.4  require  about  10, 000  flops  for  a  single 
singular  value  decomposition.  Equation  7.5,  on  the  other  hand,  requires  about  1,000 
flops  including  both  singular  value  decompositions  and  the  required  transformation 
from  D  to  L  and  R.  Even  more  substantial  savings  occur  as  n  increases. 

The  Fan  and  Tits  method  outlined  in  this  chapter  first  appeared  in  1985  [37]. 
Their  constrained  optimization  procedure  was  implemented  using  specialized  soft¬ 
ware  from  the  proprietary  Harwell  Subroutine  Library  [39].  Since  that  time,  the 
growing  acceptance  of  the  structured  singular  value  concept  has  led  to  a  number  of 
alternative  formulations  of  fi.  These  formulations  are  aimed  at  improving  its  com¬ 
putational  properties  to  allow  for  interactive  design  without  the  restrictions  imposed 
by  the  use  of  proprietary  software.  The  recent  announcement  by  the  MathWorks  of 
the  “(i- Analysis  and  Synthesis  Toolbox”  for  MATLAB™  indicates  the  demand  for 
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user-friendly  software  to  calculate  p  [40].  This  new  toolbox  uses  a  variation  of  the 
power  method  proposed  by  Packard,  Fan,  and  Doyle  to  compute  a  lower  bound  for 
p(Ma )  [41].  This  optimization  searches  for  stationary  points  of  the  singular  values  of 
(DMaD~l)  with  respect  to  the  elements  of  D.  From  the  Principal  Direction  Align¬ 
ment  theory  of  Danir1,  Kouvaritakis,  and  Latchman,  such  stationary  points  have  been 
shown  to  coincide  with  stationary  points  of  p(MaUd )  with  respect  to  the  elements  of 
Ud  [31].  However,  like  all  known  lower  bound  expressions,  the  procedure  is  noncon- 
vex  so  candidate  solutions  for  p(Ma)  must  be  compared  to  the  singular  value  upper 
bound  to  see  if  they  actually  achieve  the  true  supremum.  Again,  rather  than  per¬ 
forming  a  singular  value  decomposition  on  the  n2  x  n 2  system  DMaD~l ,  the  test  may 
be  formulated  as 

p(Ma)  <a(R~lML~l)a(LPR)  (7.6) 

involving  ly  n  x  n  matrices  where  p(Ma)  represents  a  candidate  for  p(Ma)-  If  the 
inequality  of  Equation  7.6  holds  with  equality,  then  p(Ma)  =  p{Ma )■ 

If  no  candidate  for  p  is  found  for  which  the  lower  and  upper  bounds  become  equal, 
then  the  system  may  have  a  repeated  maximum  singular  value  at  the  “inf”  of  Equation 
5.1  for  scalar  uncertainties  or  Equation  5.9  for  block  structured  uncertainties.  Lack 
of  stationarity  at  this  upper  bound  implies  that  no  p  exists  for  which  the  upper 
and  lower  bounds  become  equal.  Chapter  8  contains  a  detailed  description  of  the 
difficulties  involved  with  computing  p  when  the  maximum  singular  value  repeats. 


CHAPTER  8 

CALCULATION  OF  THE  STRUCTURED  SINGULAR  VALUE  WITH 
REPEATED  MAXIMUM  SINGULAR  VALUES 

8.1  Cusping  Singular  Values 

The  work  presented  in  the  previous  chapters  generally  requires  that  the  maximum 
singular  value  not  repeat.  This  requirement  stems  from  the  fact  that  the  inequality 
in  the  right-hand  side  of 

p(Ma)  =  sup  p(MaUd)  <  inf  cr(DMaD~l) 
ud  d 

is  not  guaranteed  to  achieve  equality  at  the  “inf”  if  a  =  <r,-,  i  ^  1  because  this 
suggests  that  a  stationary  point  with  respect  to  D  has  not  been  found.  Following  the 
infimization  of  a(DMaD~1 )  with  respect  to  D ,  three  possible  conditions  can  occur. 
These  three  conditions  are  illustrated  in  Figures  8.1  through  8.3  with  corresponding 
Ma  matrices  in  Tables  8.1  through  8.3  respectively.  Figure  8.1  represents  a  system 
in  which  info a(DMaD~l)  is  achieved  with  distinct  a(DMaD~l).  From  the  MPDA 
property  reviewed  in  Chapter  4,  this  guarantees  the  simultaneous  determination  of 
sup Udp(MaUd)  and  hence  p,(Ma). 

Figure  8.2  represents  a  second  system  where  \niDv{DMaD~l)  occurs  with  a  = 
cr2  =  supUdp(MaUd)-  The  point  at  which  the  equalities  hold  will  be  referred  to  as  a 
“kiss”  and  its  calculation  requires  special  consideration. 

Figure  8.3  shows  a  system  in  which  inf/?  a(DMaD~l)  occurs  with 
<7  =  (72  ^  snp ud  p(MaUd)  =  p(Ma).  This  case  will  be  referred  to  as  a  “cusp”  and 
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Table  8.1:  Ma  Matrix  for  Figure  8.1. 

-0.89  +  2.04;  -4.34-5.31;  -1.83  +  4.69;  3.49-6.31;  3.07  +  5.92;' 

3.87  +  17.96;  1.66  -  4.72;  6.83  +  2.94;  -7.38  -  14.37;  -8.73  -  1.29; 

-7.68-  3.20;  -11.10-8.20;  6.84  -  20.10;  -3.15  +  0.62;  2.27  +  7.21; 

-5.11-11.27;  -2.81  +  14.23;  8.19  +  11.80;  8.89  +  14.60;  2.73  -  10.69; 

.  9.83  -  18.68;  -5.39  +  6.77;  1.11+0.19;  6.29  +  2.55;  -2.83  +  6.75; 


Table  8.2:  Ma  Matrix  for  Figure  8.2. 

19.18  +  0.37;  6.82  -  1.75;  3.13-0.95;  -4.92  +  1.11;  3.34  -4.59;' 

-0.20  -  3.07;  18.56  +  1.29;  -1.44  +  0.35;  3.22  +  2.37;  -1.32  +  3.15; 

6.42  +  1.85;  -0.70  +  1.03;  15.34-2.01;  -0.77-0.82;  -0.13  +  1.36; 
0.06  +  0.64;  -0.53  -2.47;  3.53  -0.97;  10.96-3.11;  3.93  -0.96; 

-2.39  -  5.34;  3.21  -  0.78;  3.74  +  1.38;  3.24  -  0.03;  15.41  -  0.33; 


Table  8.3:  Ma  Matrix  for  Figure  8.3. 

5.18  +  0.37;  6.82  -  1.75;  3.13-0.95;  -4.92  +  1.11;  3.34  -4.59;' 

-0.20  -3.07;  4.56  +  1.29;  -1.44  +  0.35;  3.22  +  2.37;  -1.32  +  3.15; 

6.42  +  1.85;  -0.70  +  1.03;  1.34-2.01;  -0.77-  0.82;  -0.13  +  1.36; 

0.06  +  0.64;  -0.53  -  2.47;  3.53  -  0.97;  -3.03-3.11;  3.93  -0 .96; 

.  ~2-39  ~  -*5-34j  3.21  -  0.78;  3.74  +  1.38;  3.24  -  0.03;  1.41  -  0.33; 
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calculating  p(Ma)  for  such  a  system  represents  a  quite  challenging  problem.  The 
remainder  of  this  chapter  will  address  several  approaches  that  greatly  improve  upon 
previous  techniques  for  determining  p{Ma)  when  a(DMaD~x )  repeats. 

As  previously  discussed,  noncusping  systems  such  as  that  of  Figure  8.1  are  guaran¬ 
teed  to  achieve  p,  at  info  a(DMaD~x).  Using  the  following  theorem  due  to  Latchman 
[27],  it  is  possible  to  compute  the  supremizing  Ud  of  supyd  p(MaUd)  given  the  infimiz- 
ing  D  of  in{Dv(DMaD~x). 

Theorem  8.1:  Given  the  optimal  diagonal  scaling  matrix  D  corresponding  to  a 
stationary  point,  there  exists  a  diagonal  unitary  matrix,  Ud  =  diag[eJ°l , . . . ,  ejffn],  0  < 
0{  <  2i r,  where  0{  =  (arg(j/,-i)  —  arg(a;,-i)j,  i  =  1, . . . ,  n,  such  that  the  principal  singular 
vectors  Xi  and  Y\  of  DMaD~xUd  are  aligned. 

Proof:  Recall  that  p(MaUd)  <  a(DMaD~1Ud)-  For  any  system,  singular  vectors 
Aj  and  Y\  obey  the  following  relationships: 

(DMab~x)Yi  =  aXi 

Xf{i)Mab-1)  =  aY”.  (8.1) 

Introducing  a  diagonal  unitary  matrix,  Ud,  allows  Equations  8.1  to  be  rewritten  as 

(. bMab-xUd)Uld{Yx  =  aXy 
X?{DMab-xUd)  =  oYxHUd.  (8.2) 

from  Equations  8.1  and  8.2  it  is  apparent  that  Ud  may  alter  the  phases  of  X-  and 
Y)  while  continuing  to  satisfy  the  MPDA  requirement  that  the  magnitudes  of  the 
corresponding  elements  of  Ai  and  Y\  be  equal.  The  independent  phase  alignment 
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of  the  elements  of  X\  and  Yi  provides  the  necessary  information  to  compute  the 

A 

supremizing  Ud  (denoted  as  Ud )•  ■ 

A 

Using  Theorem  8.1,  the  individual  diagonal  elements  of  Ud  are  found  to  be 

Ui  =  edarg^iij-arg^)]  _  ^ ?  *  =  1} . .  .  ?  n  (8.3) 

for  element-by-element  structured  uncertainties.  A  similar  expression  reflecting  indi- 

A 

vidual  uncertainty  blocks  applies  to  the  general  block  structured  case.  The  actual  Ud 
determined  in  this  manner  is  unique  within  some  scalar  multiplier  e^. 

Example  8.1:  Applying  the  relationship  of  Equation  8.3  with 

A  A 

D  =  diag[  1,  0.460,  0.350,  0.425,  0.541],  the  corresponding  Ud  for  the  system  of  Figure 
8.1  is 

Ud  =  diag[l,  0.996  -  ;0.078,  -0.998  +  ;0.060,  0.966  +  ;0.250,  0.921  -;0.390] 

with  a(DMaD~l)  =  p{MaUd)  =  36.94  =  p{Ma). 

The  ability  to  directly  compute  Ud  given  D  provides  a  simple  means  to  ensure  that 

A  A 

a(DMaD~l)  =  fi(Ma)  because  it  guarantees  that  sup^  p(MaUd)  =  inf Da(DMaD~l). 
This  becomes  increasingly  important  as  the  conditions  illustrated  in  Figure  8.2  are 
considered. 

8.2  Systems  with  infp g(DMaD~l)  =  a2(DMaD~1)  =  p(Ma) 

When  the  singular  values  become  equal  or  nearly  equal,  the  corresponding 
singular  vectors  become  unstable  with  small  changes  in  the  cr,vs  resulting  in  large 
variations  in  the  columns  of  .Y  and  Y  [42].  This  behavior  is  a  direct  result  of  the 
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expressions  defining  the  singular  vectors  of  a  matrix  A,  namely 

AAhX{  =  a  fXi 

AhAY{  =  o?Yi.  (8.4) 


When  the  singular  values  repeat,  the  singular  vectors  combine  to  span  a  q— dimensional 
subspace  where  q  represents  the  multiplicity  of  the  repeating  er,/4.  For  q  =  2,  (the 
most  common  case),  the  first  line  of  Equation  8.4  becomes 


AAHXl  -  <t\Xx 
AAhX2  =  <r\X 2 


(8.5) 


with  the  resulting  subspace  formed  by  the  linear  combination  of  and  X2  to  give 

AAH[(l  +  jS)Xi  +  (a  +  jfi)X,)  =  <r?((7  +  ]S)Xt  +  (a  +  }fi)X,].  (8.6) 

(Note  that  Yi  is  dependent  on  X{  through  the  relationship  AHX{  =  (XiY{  and  thus  a 
similar  development  for  the  subspaces  spanned  by  Y\  and  Y2  would  be  redundant). 
From  Equation  8.6  it  is  apparent  that  any  linear  combination  of  X\  and  X2  satisfies 
the  definition  of  a  singular  vector.  This  arbitrariness  poses  a  significant  difficulty  for 
algorithms  relying  on  the  vector  alignment  properties  of  MPDA  theory  because  there 
is  no  longer  any  guarantee  that  vectors  chosen  arbitrarily  from  the  q— dimensional 
subspace  will  satisfy  the  alignment  requirements. 

For  example,  the  system  of  Figure  8.2  has  inf# ^(DMaD-1)  =  24.14.  Using  the 
method  of  Theorem  8.1  for  Xi,  Y\  and  X2,  Y2  gives  corresponding  values  of  p(MaUd) 


as  follows: 
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Xu  Yx  -  p(MaUd)  =  23.12  n(Ma) 

X2,  Y7  ->  p{MaUd)  =  20.70  ^  p(Ma) 

Both  the  first  and  second  principal  directions  must  be  considered  in  this  case  since 
both  represent  legitimate  solutions  to  Equation  8.5.  The  actual  vectors  from  the 
q— dimensional  subspace  that  do  provide  the  supremizing  Ud  may  be  found  by  selecting 
a  vector  Xnew  defined  as 

Xnetv  =  (7  +  }&)X\  +  (a  +  ]fi)X2 

as  in  Equation  8.6.  Of  course  Xnew  is  a  singular  vector  of  ( DMaD~l )  and  therefore 
any  scalar  multiple  of  Xnexu  is  also  a  singular  vector  of  (DMaD~l).  This  allows  the 
complex  multiplier  (7  +  j8)  of  X\  to  be  eliminated  with  no  loss  of  generality  leaving 
only  a  and  ft  as  unknowns.  Vector  Xnew  and  the  corresponding  vector  Ynexu  may  now 
be  written  as 


Xnew  =  Xi  +  (a  +  j/3)X2 
„  ( bMaD-')HXneu 

*ncw  —  _ 


and  a  2— dimensional  optimization  performed  over  a  and  /?  to  achieve  the  MPDA 
conditions  of  aligning  \Xnexu\  and  |T„e«;|.  Assigning  the  complex  multiplier  (a -f  j/3) 
to  X2  with  initial  conditions  of  <x  =  j3  =  0  allows  this  procedure  to  be  implemented 
very  efficiently  even  when  a  ^  02-  This  assignment  serves  to  initially  weight  X\  more 
heavily  for  cases  with  <7  >  <72  while  still  providing  for  subspace  combinations  if  a  and 


<72  coalesce. 
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Corollary  8.1:  For  systems  with  repeated  maximum  singular  value  of  multiplicity 
q  with 

mia(DMaD-1)  =  a2(DMaD~l )  =  •••  =  a^DMaD'1)  =  fi(Ma ), 
the  following  condition  must  hold: 

KMa)  =  p(MaUd)  I  |Xneu;(af)|-|Ke«;(«.)l  =  0  •  =  1, . . . , 2(«  -  1).  (8.8) 

Proof:  From  the  statement  of  the  Corollary  it  must  be  possible  to  establish  MPDA 
because  a(DMaD~l)  ^  p(Ma)  otherwise.  This  requires  that  some  linear  combination 
of  the  first  q  columns  of  X  and  Y  respectively  must  attain  alignment.  When  alignment 
is  achieved,  the  equality  of  Equation  8.8  must  hold.  ■ 

The  resulting  ^-dimensional  optimization  requires  only  vector  operations  to  align 
Xnew  and  Ynew  Although  this  function  is  not  convex,  it  requires  only  2(q  -  l)  opti¬ 
mization  variables  regardless  of  the  size  of  the  system  and  thus  proceeds  very  quickly. 
Example  8.2:  Using  the  system  of  Figure  and  Table  8.2,  the  solution  to  Equation 

A 

8.8  occurs  at  a  =  —0.0009,  and  /?  =  0.0921.  The  corresponding  optimal  Ud  is 

Ud  =  diag[  1,  0.91+;0.43,  1.0+j0.01,  -.94-;0.33,  0.41+j0.91] 

with  p(MaUd )  =  a{DMaD =  p(Ma)  =  24.14. 

The  values  of  a  and  8  are  not  robust  as  small  changes  in  D  cause  the  singular 
vectors  to  range  throughout  the  2-dimensional  subspace  spanned  by  X\  and  X2.  This 
is  important  considering  that  algorithms  used  to  solve  infp  a(DMaD~l )  depend  on  the 
analytic  derivative,  Jj|,  as  developed  in  Section  5.5.  Since  the  gradient  is  a  function 
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of  X\  and  Y\,  as  cr  approaches  ai  the  singular  vectors,  and  hence  the  gradient,  begin 
to  change  discontinuously.  This  makes  it  extremely  difficult  to  determine  whether  a 
candidate  solution  for  n  using  only  the  gradient  at  inf#  a(DMaD~l)  is  a  true  cusp 
or  simply  a  kiss.  Through  the  use  of  Equation  8.8  and  the  method  of  Theorem  8.1 
however,  the  system  of  Figure  8.2  is  shown  to  be  a  kiss. 

8.3  Systems  with  mfp  a(DMaD~l)  =  <72( DMaD -1)  ^  / u(Ma) 

For  the  case  of  a  true  cusp  as  depicted  in  Figure  8.3,  it  is  not  possible  to 
rotate  \Xnew\  Into  \Ynew\  because  this  condition  only  occurs  if  MPDA  can  be  attained. 
Although  no  theoretical  bound  on  the  gap  between  inf#  a(DMaD~l)  and  n(Ma)  has 
been  determined,  experimental  tests  on  cusping  systems  have  revealed  disagreements 
of  more  than  14%  [43].  Determining  the  actual  value  of  n  previously  required  solving 
for  sup Udp{MaUd)  over  n  —  1  free  variables  where  n  denotes  the  size  of  expanded 
matrix  Ma.  Since  this  computationally  intensive  optimization  is  nonconvex  the  lower- 
bound  by  itself  is  not  very  useful  since  it  is  difficult  to  determine  which  of  the  local 
maxima  is  actually  /z.  Therefore,  the  upper  bound  of  a{DMab~l )  is  generally  chosen 
as  an  estimate  for  /z(Ma)  even  though  this  leads  to  an  unnecessarily  conservative 
control  system  . 

Using  a  method  similar  to  that  of  the  previous  section,  it  may  still  be  possible 
to  find  an  Xnew  that  produces  a  Ud  (denoted  Unew )  “close”  to  the  supremizing  Ud- 
This  possibility  arises  from  the  fact  that  while  vectors  Xi,...,Xq  may  individually 
change  greatly  with  small  variations  in  ( DMaD -1),  the  actual  subspace  spanned 
by  linear  combinations  of  Xi, . . . ,  Xq  remains  continuous  [44,  45],  Combining  this 
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important  property  with  the  following  lemma  leads  to  an  extremely  efficient  method 
for  approximating  fi(Ma)  for  cusping  systems. 

Lemma  8.1:  For  any  A  €  CnXn  composed  of  elements  a,-j,  i,j  =  1, . . . ,  n, 

a  lim^a(/l)  =  for  fixed  i.  (8.9) 

Proof:  The  singular  values  of  A  are  the  square  roots  of  the  eigenvalues  of  AH A.  As 
an  element  along  the  diagonal  of  A  increases  or  decreases  towards  ±oo,  the  remaining 
elements  of  A  become  insignificant  in  comparison.  At  this  point  matrix  product  AHA 
may  be  approximated  by  aUauEu  making  the  maximum  singular  value  equal  to 

M*  ■ 

Lemma  8.1  simply  shows  that  by  perturbing  a  diagonal  element  of  a  cusping 
system,  the  multiplicity  of  a  must  eventually  disappear.  In  fact,  combinations  of  the 
diagonal  elements  may  be  perturbed  providing  a  number  of  different  ways  to  break  up 
a  cusp.  This  shift  eliminates  the  cusp  and  allows  the  establishment  of  MPDA  for  the 
shifted  matrix  (denoted  Ma)  guaranteeing  that  inf/? v(DMaD~l)  =  n{Ma).  MPDA 

A  A 

conditions  also  provide  a  direct  method  for  computing  U<i  given  D  through  the  use  of 
Theorem  8.1.  Modifying  this  method  to  use  Xnew  and  Ynew  from  Equation  8.7  leads 
to  the  following  Proposition. 

Proposition  8.1:  Given  some  Ma  €  CnXn  for  which 

inf  a(DMaD~l)  =  a2(DMaD~l)  =  ■■■=  <yq{DMaD~l)  #  fx(Ma), 
the  solution  over  only  2(q  —  1)  free  variables  of 
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sup p(MaUnew)  i  =  1,..., 2(^-1)  (8.10) 

at- 

is  a  close  approximation  of  n{Ma). 

Discussion:  The  continuity  of  the  spatxning  subspace  X\  •  •  •  Xq  implies  that  small 
perturbations  in  the  matrix  elements  of  Ma  will  result  in  correspondingly  small  pertur¬ 
bations  in  the  subspace.  As  a  scalar  indication  of  the  distance  between  two  matrices 
A,B€  Cnxn,  define  an  error  function  E  =  \\A  -  B\\2  [42].  Shifting  the  diagonal 
elements  of  the  cusping  system  in  Table  8.3  until  the  cusp  disappears  produces  a  new 
system  with  desirable  MPDA  properties.  If  this  new  system  is  sufficiently  close  to 
the  original  cusping  system  then  Xnew  of  the  two  systems  should  be  similar. 

Examination  of  Tables  8.2  and  8.3  reveals  that  the  two  systems  are  identical 
except  for  an  additive  shift  of  +14  to  all  the  diagonal  elements  of  Table  8.2  so  that 
Ma  =  Ma  +  14  •  I5.  Denote  Ma  and  the  infimizing  D  of  Table  8.2  as  Ma2  and  D2 
respectively.  Similarly  denote  the  corresponding  values  of  Table  8.3  as  Ma3  and  Z)3. 
Then 

E  =  \\bzMa3D?  -  D2Ma2D^\\  =  16.44. 

Since  a{DzMa3D2  )  =  13.11  from  Figure  8.3,  E  indicates  that  the  two  systems  are 
not  particularly  close. 

However,  an  alternative  shift  of  only  element  Ma(  1,1)  by  +4  also  breaks  up  the 
cusp.  Figure  8.4  and  Table  8.4  show  this  new  system  with 


fi(Ma)  =  aib^Ma.D^)  =  15.182. 


Figure  8.4:  System  from  Figure  8.3  with  Shifted  Ma(  1, 1)  Element. 


Table  8.4:  Ma  Matrix  for  Figure  8.4. 

9.18  +  0.37;  6.82  -  1.75;  3.13-0.95;  -4.92  +  1.11;  3.34  -4.59; 

0.20  -3.07;  4.56  +  1.29;  -1.44  +  0.35;  3.22  +  2.37;  -1.32  +  3.15; 

6.42+  1.85;  -0.70  +  1.03;  1.34-2.01;  -0.77  -0.82;  -0.13  +  1.36; 

0.06  +  0.64;  -0.53  -2.47;  3.53  -  0.97;  -3.03-3.11;  3.93  -0.96; 
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Error  function  E  then  becomes 

E  =  \\D3Ma3b^  -  DAMa,D^\\  =  4.77 

indicating  that  System  4  is  much  closer  to  System  3  than  System  2  was.  Since 

A 

the  spanning  subspace  is  continuous,  it  seems  reasonable  for  the  optimum  Xnew  = 
X\  +  (di  +  jctijXz  of  the  cusping  system  to  be  similar  to  that  of  Table  8.4  for  which 

A 

MPDA  is  established.  Table  8.5  shows  the  variation  of  Xnew  as  the  cusping  system 
of  Table  8.3  (Shift=0)  is  shifted  to  the  noncusping  system  of  Table  8.4  (Shift=4). 


Table  8.5:  Variation  of  Xnew  due  to  Shift  of  Table  8.3  System. 


Shift  of  Af.(l,  1) 

+0 

+1 

+2 

+3 

+4 

1 

-0.053-  3  0.237 
0.425+  j  0.210 
0.300+  3  0.240 
-0.027-  3  0.345 

1 

-0.070  -  3  0.248 
0.424  +  j  0.166 
0.309  +  ;  0.246 
-0.058  -  j  0.352 

1 

-0.080  -  3  0.254 
0.409  +  3  0.135 
0.314  +  3  0.244 
-0.075  -  3  0.355 

1 

-0.089  -  3  0.258 
0.388  +  3  0.112 
0.315  +  3  0.241 
-0.087  -  3  0.354 

1 

-0.10  -  3  0.26 
0.37  +  3  0.09 
0.31  +  3  0-24 
-0.09  -  3  0.35 

Since  shifting  other  combinations  of  matrix  elements  will  also  break  up  the  cusp, 
it  may  certainly  be  possible  to  find  noncusping  systems  for  which  E  is  less  than  the 
4.77  achieved  by  System  4.  Regardless  of  the  shifting  combination  used  to  break  up 
a  cusp,  the  2 (q  —  l)-dimensional  subspace  spanned  by  Xi, . . . ,  Xq  must  vary  contin¬ 
uously  from  the  cusping  condition  to  any  of  the  noncusping  conditions.  This  implies 

A 

that  the  spanning  subspace  is  sufficiently  well-behaved  to  provide  an  Xnew  from  Equa¬ 
tion  8.7  and  corresponding  Unew  that  closely  approximates  Ua  of  p^MJJa)  —  //.(Mc). 
Although  no  bounds  have  been  placed  on  t  he  difference  between  sup0.  p(MaUnew )  and 
sup Ud  p{MaUd),  numerical  experience  suggests  that  this  difference  is  quite  small.  In 
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fact,  the  largest  deviation  between  this  estimate  and  p  found  to  date  is  less  than 
0.34%.  ■ 

Applying  Equation  8.10  to  the  cusping  system  of  Table  8.3  gives 

Unew  =  diag[  1,  0.75  -;0.67,  -.48  +  j0.88,  -1.00 -;0.02,  0.78  —  j0.63] 

with  ^  =  -.23,  a2  =  -0.11  and  p(MaUnew )  =  12.768. 

To  determine  whether  Unew  is  actually  a  candidate  for  /*,  the  derivative  dp/d0{  for 

A 

i  —  1,. . .  ,n  at  Unew  may  be  examined  for  its  proximity  to  the  0  vector.  In  this  case 
it  is  found  to  be 

0.24 
0.10 
-0.19 
0.31 
0.17 

indicating  that  Unew  is  indeed  almost”  a  stationary  point.  Using  Unew  as  an  initial 
condition,  a  direct  supremization  over  all  Ud  returns  a  value  of 

Ud  =  diag[l,  0.59-;0.80,  -.53+;0.85,  -.99-;0.11,  0.66-;0.75]  (8.11) 

with  p(MaUd )  =  p{Ma)  =  12.81.  The  difference  between  p{MaUncw )  and  p(MaUd)  in 
this  case  is  0.33%. 

This  particular  example  was  chosen  because  it  produced  the  largest  percent  dif¬ 
ference  between  p{MaUnew )  and  p(Ma)  out  of  over  100  cusping  systems  tested.  The 
next  largest  percent  difference  was  10  times  smaller  than  this  case:  about  0.03%. 


dp_ 

ddi 
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While  larger  differences  are  certainly  possible,  the  fact  that  in  the  majority  of  ex- 

A  A 

amples  investigated,  p(MaUnetu )  and  p(MaUd)  agreed  within  four  significant  figures 
is  most  encouraging.  Table  8.6  on  page  114  contains  a  comparison  between  p(Ma) 
and  p(MaUnew )  for  several  cusping  systems.  (This  comparison  will  be  discussed  more 
completely  in  Section  8.8.) 


8.4  Effects  of  Nonconvexity 

It  must  be  emphasized  that  both  the  2 (q  —  l)-dimensional  search  over  the  a,- 
and  the  n  —  1  dimensional  search  over  Ud  are  nonconvex.  Algorithms  designed  to 
find  the  global  maximum  must  employ  some  procedure  to  search  the  domains  of  Ud 
or  Unew  respectively.  Therefore,  the  advantages  of  the  2 (q  -  l)-dimensional  search 
become  more  profound  because  the  domain  of  Unew  does  not  increase  with  system 
size. 

Although  repeated  singular  values  with  multiplicity  q  =  n  are  theoretically  possi¬ 
ble,  by  far  the  more  common  case  is  for  q  =  2.  For  such  systems,  an  additional  bonus 
of  a  2 (q  —  1)  =  2— dimensional  search  is  that  3— dimensional  mesh  plots  of  p(MaUd ) 
vs  and  a2  may  be  generated  allowing  all  maxima  to  be  examined.  This  effectively 
eliminates  the  difficulties  of  multiple  maxima  because  all  candidate  points  for  p  can 
easily  be  examined. 

Such  a  mesh  plot  for  the  system  of  Figure  8.3  is  presented  in  Figure  8.5.  In 
this  plot,  a\  and  a2  range  from  —2  to  2.  Two  pronounced  local  maxima  are  readily 
apparent:  one  corresponding  to  the  value  of  p(MaUnew)  =  12.768  and  the  other  at 
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p(MaUnew )  =  12.741  where 

Unew  =  diag[ l,  -.13  +  j0.99,  1.00+j0.084,  -.91  +^0.42,  -0.81+;0.58] 

with  cti  =  0.25  and  a i  —  0.54.  Using  Un«w  as  a  starting  point,  a  search  over  all  Ud 
results  in  a  local  stationary  point  at 

Ud  =  diag[  1,  -.17  4-  j0.98,  0.95  +  j0.32,  -.94  +  ^0.33,  -.92  +  j0.40] 
with  corresponding  p(MaUd)  =  12.76  ^  P-{Ma)  =  12.81. 

8.5  Range  of  a,- 

The  ability  to  c-\:ily  select  local  maxima  of  p{MaUnew)  from  a  mesh  plot  as 
candidates  for  p.  is  a  tremendous  improvement  over  previous  methods  which  require 
optimizing  p{MaUd)  over  as  many  as  n  —  1  variables,  especially  as  the  system  size 
increases.  The  range  of  an-’  >o  used  in  Figure  8.4  appears  to  adequately  cover 
the  subspace  spanned  by  X\  and  X2  allowing  U„ew  to  be  found.  As  and  c*2 
move  away  from  the  local  maxima,  the  magnitude  of  p  decreases  suggesting,  (but  not 
guaranteeing),  that  no  additional  points  of  interest  exist  outside  of  the  plotted  region. 
It  is  important  to  determine  the  maximum  range  of  the  a;  because  if  the  bounds  can 
only  be  placed  at  ±co,  then  a  mesh  plot  or  any  other  search  method  would  clearly 
be  impractical. 

For  the  case  of  multiplicity  q  =  2,  determining  limits  on  <*1  and  0"2  requires  ex¬ 
amination  of  Equation  8.7.  Placing  the  complex  multiplier  (oi  -j-  ja 2)  on  X2  makes 
the  worst  case  condition  when  the  optimum  XnCw  —  Xi  so  that  X\  makes  no  con¬ 
tribution.  Such  a  condition  implies  that  c*i  =  00  is  required  to  produce  the  desired 
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A 

Xnew  =  X2.  However,  since  X\  and  X2  are  singular  vectors  they  may  be  normalized 
such  that  ||Xi|j  =  ||AT2||  =  1-  Since  the  2— norm  of  a  vector  is  greater  than  or  equal  to 
any  element  of  that  vector,  no  element  of  X\  or  X2  can  be  greater  than  1.  Therefore, 
in  forming  Xnew  =  X\  +  (cci  +  ]ct2)X2,  »i  and/or  a2  values  much  greater  than  1  will 
essentially  eliminate  any  contribution  of  X\  so  Xnew  quickly  reaches  its  asymptotic 
value  of  X2.  In  the  rare  case  that  such  a  worst  case  condition  occurs,  values  of  c*i  or 
a2  of  20  or  less  should  be  more  than  adequate  to  produce  the  optimum  Unew.  In  prac¬ 
tice,  large  values  of  o^  and/or  a2  can  be  eliminated  by  merely  shifting  the  complex 
multiplier  from  X2  to  X\  so  that 

Knew  —  (<*1  +  ]<*2)Xi  +  X2 

For  Xnew  &  X2,  the  condition  of  ot\  =  a2  =  0  then  prevents  any  contribution  from 
AV 


The  ability  to  compute  candidates  for  [X  using  only  2 (q  —  1)  variables  provides  an 
extremely  practical  means  for  dealing  with  cusping  singular  values.  After  determining 
suptti  p(MaUnew)  the  result  may  compared  to  the  singular  value  upper  bound  and  if 
the  conservatism  gap  does  not  exceed  desired  tolerances  no  further  optimization  is 
required.  The  conservative  upper  bound  is  simply  chosen  as  an  estimate  for  p  and 
the  frequency  sweep  over  the  desired  range  continues.  In  the  event  that  gap  between 
the  upper  and  lower  bounds  exceeds  the  termination  requirements,  the  supremizing 

A 

matrix  Uncw  offers  an  improved  initial  condition  for  a  search  over  the  full  set  (n  —  1)  of 


103 


optimization  variables.  Unfortunately,  the  computational  aspects  of  this  additional 
search  are  potentially  overwhelming:  especially  as  the  system  size  increases.  Rather 
than  continuing  the  search  for  /x  using  eigenvalue  calculations,  it  will  be  shown  that  a 
simple  transformation  of  Une w  allows  the  optimization  to  proceed  using  singular  value 
decompositions.  Because  the  singular  values  are  the  square  roots  of  the  eigenvalues 
of  positive  definite,  symmetric  matrices,  ( AH A  or  AAH)>  the  singular  value  decompo¬ 
sition  offers  significant  improvements  in  computing  efficiency  over  general  eigenvalue 
decompositions.  For  example,  using  the  MATLAB™  “eig”  and  “svd”  functions  on 
10  random,  complex  9x9  matrices,  the  average  flop  counts  are  as  follows: 

eig  — >  53,606  flops 
svd— ►  38,448  flops. 

As  the  system  size  increases,  this  difference  becomes  even  more  significant  indicat¬ 
ing  the  obvious  computational  advantage  of  the  singular  value  decomposition  versus 
the  eigenvalue  decomposition.  It  must  also  be  noted  that  the  flop  counts  above  in¬ 
clude  calculation  of  the  singular  vectors  while  the  eigenvectors  were  not  similarly 
determined. 

The  MPDA  property  of  Chapter  4  shows  that  the  unique  stationary  point  as¬ 
sociated  with  info  a{DMaD~l)  corresponds  to  a  stationary  point  of  sup Udp(MaUd)- 
When  <7  repeats  there  may  be  no  stationary  point  of  a  so  the  direct  relationship  be¬ 
tween  <r  and  /x  is  in  general  lost  (the  “kiss”  condition  is  an  exception).  However,  using 
the  principal  direction  alignment  (PDA)  property  developed  by  Daniel,  Kouvaritakis 
and  Latchman  ,  a  similar  relationship  is  shown  to  exist  between  cr{(DMaD~l)  and 


p(MaUd )  where  the  <rt-  represents  some  singular  value  other  than  the  maximum  [31]. 
Development  of  the  PDA  property  starts  with  the  following  lemma  describing  the 
weil  known  eigenvalue  shift  property. 

Lemma  8.2:  For  any  matrix  A  G  CnXn  and  scalar  r 

A,(A  +  r/)  =  A,(A)  +  r 

Proof:  Denote  the  eigenvalues  of  matrix  A  as  A  and  of  matrix  (A  +  rl )  as  A.  The 
respective  eigenvalues  are  simply  the  solutions  to 

det{\I  —  A}  =  0 

and 

det{\I  —  A  —  rl}  =  0. 

Now  define  (A  -  r)  =  A  and  rewrite  the  second  equation  as 

det{XI  —  A}  =  0. 

Since  the  eigenvalues  of  a  matrix  are  the  roots  of  its  characteristic  equation  they 

»  A  A 

must  be  unique;  therefore  A  =  A.  Substituting  back  for  A  gives  A  =  A  +  r  and  thus 
completes  the  proof.  ■ 

While  the  eigenvalues  of  a  matrix  obey  the  shift  property,  the  singular  values  do 
not.  As  in  Section  8.3,  this  provides  a  means  of  breaking  up  the  repeated  singular  val¬ 
ues  by  introducing  a  sufficiently  large  r.  The  spectral  radius/singular  value  inequality 
then  becomes 
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Equation  8.12  requires  that  Ai  (MaUd)  corresponding  to  p(MaUd )  be  real.  Such  a 
requirement  is  not  a  limitation,  however,  since  Ud  may  be  rotated  by  a  scalar  multiplier 
e ^  without  affecting  the  magnitude  of  p(MaUd ).  Rewriting  Equation  8.12  into  a 
structured  singular  value  relationship  results  in 

p(Ma)  =  sup  p(MaUd)  =  sup  p(MaUd  +rl)  —  r  =  sup  inf  c(DMaUdD~l  +  rl)  -  r. 

Ud  Ud  ud  D 

(8.13) 

Unfortunately,  the  convexity  properties  that  made  the  optimization  info  a(DMaD~l) 
so  attractive  originally  are  now  lost  because  the  right-hand  side  of  Equation  8.13  is 
no  longer  invariant  to  Ud-  Still,  the  addition  of  scalar  r  offers  an  alternative  means 
of  establishing  MPDA  for  cusping  systems.  The  following  theorem  is  originally  due 
to  Latchman  [27]. 

A  A 

Theorem  8.2:  Let  a( DMaUdD -1  +r/)  have  stationary  points  at  Ud  and  D  respec- 

A  A  A 

tively.  If  r  is  large  enough  to  ensure  that  a(DMaUdD~l  +  rl )  is  simple,  then  MPDA 
is  established  for  the  matrix  ( DMaUdD~l  +rI)J  where  J  is  a  matrix  containing  mily 
±1  along  the  diagonal  and  zeros  elsewhere. 

Proof:  The  stationary  points  of  supj/d  inf oa(DMaUdD~l  +  rl)  occur  when 

~[u{DMaUdD-1  +  rl)}  =  0  (8.14) 

ud{ 

and 

■jlplDMMD-'  +  rl) )  =  0.  (8.15) 

The  stationary  points  of  Equation  8.14  have  already  been  shown  to  occur  upon  es¬ 
tablishment  of  MPDA  for  the  matrix  (DMaUdD"1  +rl).  This  condition  applies  only 
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when 

|Aiil  =  IKi|  (8.16) 

for  all  i.  Differentiation  of  Equation  8.15  follows  in  a  manner  similar  to  that  of  Section 
5.5  so  that  the  stationary  points  are  characterized  by 

£{rtHEsXt  -  X?EiiYt)  =  0 

or 

Im(X?EuYi)  =  0  (8.17) 

for  all  i.  The  requirement  of  Equation  8.17  may  be  stated  in  terms  of  the  arguments 
of  the  individual  vector  elements  such  that 

arg(Ara)  =  arg(yh)  +  m,-7 r  (8.18) 

for  all  i  with  integer  m,-.  Applying  Equations  8.16  and  8.18  requires  that 

X:  =  JYX 

and  completes  the  proof.  ■ 

Progressing  from  this  point  to  MPDA  therefore  requires  that  J  —  I  for  all  cases  in 
which  both  Equations  8.14  and  8.15  are  satisfied.  The  following  theorem,  originally 
due  to  Latchman  with  simplifications  by  Young  [27,  46]  shows  that  J  must  equal  I. 

Theorem  8.3:  Let  a{DMaUdD~l  +  rl)  have  stationary  points  at  Ud  and  D  respec¬ 
tively.  If  r  is  large  enough  to  ensure  that  c{DMaUdb~l  +rl )  is  simple,  then  MPDA 
is  established  for  the  matrix  ( DMaUdb~l  +  rl). 
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Proof:  As  mentioned  above,  this  proof  reduces  to  showing  that  J  =  I  at  all 
stationary  points  of  Equation  8.13.  Matrix  ( DMaUdD~l  +  rI)J  has  been  shown  to 
satisfy  the  MPDA  requirement.  Therefore 

p(DMaUdD -1  +  rl)  <  a{[bMaUdb-x  +  rI]J). 

Multiplying  by  and  AT  gives 

X? {bMaUdb~x  +  r/)A,  <  X?{bMaUdb~x  +rI)JX, 

which  may  be  simplified  as 

rX?(I  -  J)Xi  <  XlHbMab-1Ud{J  -  I)Xi.  (8.19) 

Clearly  as  r  increases,  the  left-hand  side  of  Equation  8.19  increases  while  the  right- 
hand  side  remains  unchanged.  The  inequality  would  therefore  be  contradicted  unless 
J  —  I  which  completes  the  proof.  ■ 

Theorem  8.2  reveals  that  for  r  large  enough  to  separate  a  and 

sup  p(MaUd)  =  sup  inf  atDMaUdD'1  +  rl)  -  r. 

Vi  Ud  D 

More  importantly,  the  following  lemma  serves  to  relate  stationary  points  of 
supUdMDa(DMaUdD-1  +  rl)  to  those  of  cri(DMaUdD~1)  where  the  ov4  represent 
singular  values  with  magnitudes  less  than  a.  This  new  formulation  is  invariant  to  Ud 
and  no  longer  requires  the  shift  by  r. 

A  A 

Lemma  8.3:  For  D  and  Ud  defined  as  optimizing  solutions  to  Equation  8.13,  denote 
AT  as  the  left  major  singular  vector  of  (DMaUdb~l  +rl).  Then  Ai  and  its  correspond¬ 
ing  conjugate  transpose,  X^ ,  are  also  right  and  left  eigenvectors  of  ( DMaUdb~l ).  In 
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AAA 

addition,  matrix  ( DMaUdD~l )  has  some  right  singular  vector  which  is  aligned  with 
its  corresponding  left  singular  vector.  This  alignment  between  nonmajor  singular 
vectors  will  be  referred  to  as  Principal  Direction  Alignment  [31]. 

Proof:  From  Theorem  8.2  it  is  known  that  ( DMaUdD~l  +  rl)  has  MPDA  and 
therefore  X\  and  X(f  form  a  left  and  right  eigenvector  pair  for  (DMaUdD~l  +  rJ). 
From  the  eigenvalue  shift  property  of  Lemma  8.2,  X\  and  Xf1  must  form  a  left  and 
right  eigenvector  pair  for  ( DMaU<iD~l )  as  well.  This  allows  the  eigenvalue  decompo- 

A  A  A 

sition  of  ( DMaUdD _1)  to  be  written  as 


DMaUdD~x  = 


■ 

1 

Ai 

0 

x? 

Xi 

w 

0 

A 

VT 

where  A,  W  and  VT  include  all  the  eigenvalues,  and  right  and  left  eigenvectors  except 
Aj ,  X\  and  X{{  respectively.  Inspection  of  the  hermetian  products 


(DMaUdD-'f  {DMaUdD-1) 


and 

{DMaUdb-l){bMaUdb-x)H 

as  in  Equation  8.4,  reveals  that  X\  is  both  a  right  and  corresponding  left  singu¬ 
lar  vector  associated  with  some  a^DMaUdD-1).  Therefore,  ( DMaUdb~l )  has  PDA 
concluding  the  proof.  ■ 

Combining  the  results  from  Theorem  8.2  and  Lemma  8.3  leads  to  the  following 
theorem  concerning  the  direct  relationship  between  sup^  p{MaUd)  and 
a{{DMaUdb-1). 
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4  A  A 

Theorem  8.4:  For  a\--  matrix  j*  ■  L ,  there  always  exists  an  optimal  Ud  and  D  such 
that  ( DMaUdb~l )  achie  -js  PO.v  and  the  corresponding  singular  value  is  a  stationary 
point  with  respect  to  D.  Also,  under  these  circumstances  the  following  condition 
arises: 

sup p(MaUd)  =  max  j ai(DMaD~l)  |  A[ ai{DMaUdD -1)]  =  o|  (8.20) 
for  ij  =  1 

A  A 

Proof:  Theorem  8.2  guarantees  the  existence  of  an  optimizing  Ud  and  D  such  that 
( DMaUdb~l )  satisfies  MPDA.  The  shift  property  may  then  be  employed  giving 

p(MaUd)  =  <r(bMaUdb -1  +rl)-r  =  <7I■(Z)MaX)-,). 

Theorem  8.3  shows  that  for  optimal  D  and  Ud  matrix  ( DMaUdb~l )  will  have  PDA. 
For  the  case  of  i  =  1,  a  does  not  cusp  and  the  MPDA  property  is  achieved.  When 
i  >  1,  achieving  PDA  for  {bMaJJdb~l)  requires  one  of  the  singular  vector  pairs  to  be 
aligned  indicating  that  ai(DMab~l)  is  a  stationary  point  with  respect  to  D.  Selecting 
the  largest  such  stationary  point  completes  the  proof.  ■ 

Example  8.3:  This  example  illustrates  these  results  using  the  system  of  Table  8.3. 
The  supremizing  Ud  is  the  same  as  that  presented  in  Equation  8.11.  Infimizing  the 
right-hand  side  of  Equation  8.20  over  D  with  r  >  1.87  gives 

D  =  diag  [1,  0.749,  1.106,  1.225,  1.218] 

and 


fi(Ma)  =  p{MaUd )  =  a{bMaUdb~x  +  rl)  —  r  =  cr2(bMab~x)  =  12.81. 


no 


The  relationship  of  Equation  8.20  allows  the  singular  value  decomposition  to  be 
retained  in  the  computation  of  p  even  for  repeated  a.  Unfortunately  the  stationarity 
condition  on  <x ,■  no  longer  restricts  the  search  for  p  to  minima  or  maxima  but  to  inflec¬ 
tion  points  as  well.  This  introduces  additional  complexity  to  an  already  nonconvex 
function  as  the  search  criteria  now  includes  sorting  through  all  stationary  points.  If, 
however,  the  search  over  the  a,-  produces  a  value  of  p(MaUne w)  close  to  p(Ma),  then 
an  optimization  based  on  the  right-hand  side  of  Equation  8.20  may  offer  tremendous 
computational  advantages  over  the  left-hand  side  of  the  equation. 

A  A 

8.7  Direct  Calculation  of  D  from  Ud 

A  A 

Section  8.6  introduced  a  method  for  directly  calculating  Ud  given  D  such  that 
a{DMab~l)  =  p(MaUd)  =  p(Ma)-  Using  the  2 (q  -  l)-dimensional  search  over  a,-, 
a  candidate  for  Ud  denoted  Unew  may  be  calculated.  In  order  for  this  estimate  to 
serve  as  a  useful  initial  guess  for  the  right-hand  side  of  Equation  8.20,  some  means  of 
transforming  Unew  into  a  corresponding  Dnew  must  be  found.  Such  a  transformation 
was  originally  presented  by  Fan  and  Tits  as  a  means  of  verifying  that  the  solution 
to  their  nonconvex  vector  optimization  method  equals  p  [38].  By  modifying  their 
original  theorem  and  proof  pertaining  to  this  transformation  it  is  possible  to  directly 
determine  D  corresponding  to  Ud  such  that  ai{DMab~ x)  =  p(MaUd)  even  when  a 
repeats.  Equation  7.2  is  repeated  here  for  convenience. 

p(Ma)  =  max{||Af0a:||2 1  ||?iz||3||Af«a;||2  =  ||7W0z||2,  i  =  (8.21) 

x£Cn 


Ill 


The  following  proposition  and  subsequent  theorem  are  originally  due  to  Fan  and 
Tits  with  modifications  to  account  for  systems  with  repeated  singular  values  [38]. 
Proposition  8.2:  Suppose  x  €  Cn  is  a  solution  to  Equation  8.21.  Then  there  exists 

A 

a  vector  A,-,  i  =  1, . . . ,  m  such  that  the  following  equation  holds: 

M?Max  +  £  A,-  [M"V{Max  -  \\V{x\\2M?  Max  -  \\Max\\lVix\  =  0  (8.22) 

«=i 

where  the  V-x  are  projection  matrices  as  in  Chapter  7  and  the  A,-'*  are  multipliers  with 
At-  >  0,  i  =  1,. . .  ,n. 

Theorem  8.5:  Let  A  =  [a\  , . . . ,  An]  be  the  unique  multiplier  vector  corresponding 
to  the  optimizing  x.  Then  D  =  diag  Ja\  2 , . . . ,  An2  is  the  solution  to  Equation  8.20. 
Proof:  From  Chapter  7,  some  Yx  must  exist  such  that 

b-'MUb2Mab-xYi  =  ai(DMab~1)Yi  =  ii2{Ma)Y{  =  \\Max\\2Yi. 

Multiplying  on  the  left  by  D  gives  the  expression 

M?D2Max  =  \\Max\\2b2x.  (8.23) 

Now  define  some  such  that 

/?,■  =  ad?  i  =  1, . . . ,  m  (8.24) 

with  a  selected  such  that  YZLi  Pi\[Pix\\\  ~  1’  Then  aD2  =  )Cf=i  fliPi  and  Equation 
8.23  becomes 

m  r  i 

£/?,•  [M'JVtM.x  -  \\M.x\\\Vix]  =  0 
1=1 

M»M.  x  +  [MfViM'X  -  \\Vix\\lM»  Max  -  \\M,x\%P,x]  =  0. 

1=1 


giving 
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A 

Equating  /?,-  =  A,-,  this  equation  becomes  identical  to  Equation  8.22  and  through  the 
relationship  of  Equation  8.24  the  proof  is  complete.  ■ 

A  A 

It  is  now  possible  to  directly  determine  a  D  given  Ud  such  that 

A  AA  ,  A  A 

p(MaUd )  =  <Ti{DMaD~l)  =  fi(Ma).  Again  using  Ud  from  Equation  8.11,  D  is  found 
to  be 

D  =  diag[ 1,  0.749,  1.106,  1.225,  1.218] 

with 

p(Ma)  =  p(MaUd)  =  atiDMaD-1)  =  12.81 

providing  the  same  results  as  before.  However,  no  additional  optimization  is  required 
for  this  method  as  opposed  to  Example  8.3  which  requires  an  infimization  over  D  of 
a(DMaUdD~ 1  +  rl).  Provided  that  the  2 (q  -  l)-dimensional  optimization  generates 
a  Unew  close  to  f/j,  the  transformation  from  Unew  to  Dnew  may  be  invoked  to  solve 
for  fi  using  the  method  of  Equation  8.20. 

8.8  Comparison  of  Optimization  Methods 

The  results  of  this  chapter  are  summarized  in  Tables  8.6  and  8.7.  Table  8.6 
lists  system  size,  the  results  of  inf# a(DMaD~x),  the  value  p(Ma),  the  results  of 
snPa,  P(MaUnew),  and  the  per  cent  difference  between  p(Ma)  and  supa.  p{MaUnew)- 
System  6  corresponds  to  that  of  Figure  8.3  and,  as  mentioned  earlier,  it  shows  the 
largest  deviation  between  and  sup^.  p{MaUnCw)  of  all  cusping  systems  studied 

to  date.  System  5  shows  the  largest  deviation  between  in{pa(DMaD~1)  and  p(Ma) 
while  the  corresponding  value  of  supa.  p(MaUnew)  equals  p(Ma )  within  four  significant 
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figures.  Clearly  the  result  of  the  2 (q  —  l)-dimensional  optimization  supa;  p(MaUnew) 
provides  an  excellent  estimate  of  p. 

Depending  on  the  particular  control  system  under  development,  it  is  likely  that 
these  estimates  of  p  will  be  satisfactory  without  any  additional  optimization.  In  the 
event  that  p  must  be  determined  exactly,  the  optimum  Unew  from  supa.  p(MaUnew ) 
may  be  used  as  an  starting  point  for  further  searches.  Using  the  same  systems  as 
Table  8.6,  Table  8.7  summarizes  the  flop  count  requirements  of  the  various  methods. 
Each  of  the  optimization  methods  call  a  series  of  MATLAB™  m-files  developed 
by  Grace  [33].  While  m-files  generally  execute  more  slowly  than  compiled  code,  the 
ability  to  track  flop  counts  provides  a  means  of  comparing  the  efficiency  of  algorithms 
regardless  of  the  host  machine. 

In  order  to  provide  as  equitable  a  comparison  between  methods  as  possible  each 
cusping  system  was  required  to  satisfy  three  criteria.  Of  all  cusping  systems  tested, 
only  those  meeting  these  criteria  are  included  in  Tables  8.6  and  8.7.  (Note:  It  should 
be  mentioned  that  failure  to  satisfy  these  criteria  does  not  mean  that  the  optimization 
methods  could  not  find  p  for  a  particular  system,  it  simply  means  that  any  flop  count 
comparison  would  be  erroneous.) 

The  first  selection  criterion  is  that  optimization  methods  A  and  B  (sup^  p(MaUd) 
and  supa.  p(MaUnew )  respectively),  must  begin  at  the  same  starting  point.  This  point 
is  a  value  of  Uj,  generated  by  the  method  of  Theorem  8.1.  For  q  =  2,  two  values  of 
Ud  are  generated  with  U\  corresponding  to  A'i  and  Y\  and  U2  corresponding  to  X 2 
and  Y2.  Since  these  singular  vector  pairs  are  effectively  interchangeable  at  the  cusp 
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System 

Size 

inf  vaiDMaD-1) 

V- 

supa.  p(MaUnew) 

%  Diff 
fi  vs  p(l 

1 

4 

6.8125 

6.7803 

6.7803 

-0.1 

2 

4 

5.9398 

5.9015 

5.9015 

-0.< 

3 

4 

7.1238 

6.9697 

6.9488 

-0. 

4 

4 

6.5750 

6.3673 

6.3671 

-0. 

5 

4 

1.0322 

0.9470 

0.9470 

-0. 

6 

5 

13.114 

12.810 

12.761 

-0.326 

7 

5 

31.710 

31.443 

31.440 

-0.009 

8 

5 

35.838 

35.615 

35.604 

-0.031 

9 

5 

43.913 

43.809 

43.807 

-0.007 

35.126 

34.930 

34.922 

-0.022 

7.7400 

7.8366 

9.7072 

29.473 

128.02 

74.749 

151.95 

209.64 

35.238 

35.918 


7.7394 

7.8346 

9.6193 

29.254 

127.07 

74.502 

150.29 

209.21 

34.610 

35.339 


7.7394 

7.8346 

9.6185 

29.253 

127.07 

74.501 

150.28 

209.20 

34.610 

35.336 


.0 
.0 
-0.008 
-0.003 
-0.002 
-0.001 
-0.002 
-0.001 
-0.001 
-0.008 
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Table  8.7:  Floating  Point  Operations  Comparison 


A 

Optimization  Method 

B  |  C  | 

D 

System 

sup^  p(MaUd ) 

SUPct;  p(MaUnew) 

B  i 

SUP  U,p{MaUd) 

B  1 

max,-  (Ti{DMaD~l ) 

1 

234,427 

128,578 

149,067 

149,899 

2 

138,177 

113,174 

144,883 

150,354 

3 

128,398 

136,015 

191,239 

157,372 

4 

188,109 

193,382 

212,596 

214,373 

5 

171,721 

63,162 

74,317 

79,7S7 

Average: 

172,166 

126,862 

154,420 

150,357 

6 

579,929 

301,294 

395,592 

343,173 

7 

382,910 

299,005 

550,583 

343,313 

8 

709,431 

285,912 

452,255 

329,634 

9 

839,772 

553,011 

722,400 

595,733 

10 

415,088 

256,610 

582,077 

300,868 

Average: 

585,426 

339,166 

540,581 

382, 5^4 

11 

677,513 

226,805 

408,046 

452,390 

12 

1,037,283 

384,400 

563,199 

611,346 

13 

1,992,286 

816,698 

1,221,836 

1,039,923 

1 4 

2,909,386 

1,172,394 

1,850,040 

1,398,542 

'  15 

2,850,485 

1,078,454 

2,345,768 

1,307,23S 

16 

3,203,488 

984,491 

1,952,090 

1,211,701 

17 

3,327,218 

1,240,607 

2,200,240 

1,465,977 

18 

2,786,795 

1,330,380 

2,342,328 

1,556,346 

19 

3,349,508 

946,319 

1,687,675 

1,113,345 

20 

4,492,976 

1,684,968 

3,521,077 

1,937,664 

Average: 

2,662,694 

986,551 

1,809,229 

1,109,472 
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(see  Equations  8.4  and  8.5),  both  U\  and  U<i  provide  candidate  starting  points  for 
the  subsequent  optimizations.  The  choice  between  the  two  is  made  by  selecting  the 
maximizer  of  p(MaU{)  for  i  =  1 The  extra  flops  required  to  select  the  desired 
Ui  are  generally  more  than  offset  by  increased  convergence  rates  resulting  from  the 
superior  starting  point. 

Choosing  Ui  or  U2  as  a  starting  point  for  sup^  p(MaUd)  corresponds  to  choosing 
whether  the  complex  multiplier  (<*i  -f  acts  on  X\  or  X2  in  Equation  8.7.  Using 
<*1  =  <*2  =  0  as  an  initial  condition,  matrix  U\  results  from 

Xnexu  =  A'l  +  (oil  +  ]<*'})  X2 

while  U2  results  from 

Xnew  —  (<*1  +  3&2)Xi  +  X2. 

This  procedure  allows  sup Udp{MaUd)  and  sup0.  p(MaUncvJ )  to  start  at  the  same  point 
satisfying  the  first  selection  criterion. 

The  second  criterion  requires  the  optimization  to  converge  to  p  from  the  common 
starting  point.  This  greatly  reduces  the  number  of  systems  available  for  flop  count 
comparisons  because  the  methods  are  all  known  to  be  nonconvex.  While  changing 
the  initial  starting  points  would  eventually  lead  to  p,  the  possibility  of  an  equitable 
flop  count  comparison  is  eliminated.  This  local  convexity  criterion  extends  to  the 
methods  of  columns  C  and  D  of  Table  8.7  as  well.  For  these  methods,  supa.  p(MaUnew) 
provides  an  improved  starting  point  for  subsequent  optimizations  by  sup^  p(MaUd) 
and  m&Xia^DMaD"1)  respectively.  As  discussed  in  Section  8.6,  the  optimization 
based  on  <ri(DMaD~l )  searches  for  all  stationary  points  including  maxima,  minima, 
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and  inflection  points.  While  the  computational  advantages  of  the  singular  value 
decomposition  makes  it  quite  fast,  its  nonconvexity  generally  prevents  its  use  without 
the  improved  starting  point  provided  by  supa.  p{MaUntw). 

The  final  selection  criterion  involves  termination  conditions  for  the  separate  meth¬ 
ods.  While  the  end  results  of  methods  A,  C,  and  D  should  be  identical,  differences  in 
problem  formulation  prevent  the  use  of  a  common  termination  condition.  If  one  algo¬ 
rithm  employs  a  more  strict  termination  condition,  it  may  unnecessarily  require  more 
flops  than  a  second,  less  strict  method  to  achieve  the  same  results.  To  resolve  this 
dilemma,  parameters  were  adjusted  to  halt  the  optimization  of  sup^  p(MaUd)  when 
llj^-ll  <  0.02.  Corresponding  parameters  for  max,  cr i(DMaD~l)  were  then  set  such 
that  termination  occurred  when  max,-  (Ji{DMaD~l )  =  p(MaUj)  within  four  significant 
figures. 

Termination  conditions  for  supa.  p(MaUnew )  were  adjusted  such  that  the  optimiza¬ 
tion  successfully  halts  when  all  terms  of  the  numerical  gradient  fall  below  10-2.  Tests 
with  varying  termination  conditions  suggest  that  further  reductions  in  the  numerical 
gradient  produce  little  if  any  improvement  in  p(MaUnew)- 

Having  discussed  the  three  selection  criteria,  the  flop  count  comparison  of  Table 
8.7  may  be  examined.  The  superiority  of  Method  B  in  terms  of  computing  efficiency 
is  immediately  apparent.  For  the  4  x  4  systems,  an  average  of  172, 166  flops  was 
required  by  Method  A  versus  126, 862  flops  for  Method  B:  a  savings  of  26%.  This 
savings  increases  to  42%  for  the  5x5  systems  and  63%  for  the  9x9  systems.  For 
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design  problems  requiring  exact  values  of  //,  Method  D  offers  a  savings  of  58%  over 
Method  A  and  39%  over  Method  C  for  9  x  9  systems. 

The  efficiency  of  Method  D  for  finding  the  exact  value  of  p  seems  to  be  a  direct 
result  of  the  excellent  starting  point  provided  by  first  solving  for  sup^.  p(MaUnew). 
In  fact,  for  many  of  the  20  systems  the  flop  count  difference  between  Methods  B 
and  D  simply  reflects  the  calculations  involved  in  computing  |j|  one  time.  The 
resulting  derivative  was  sufficiently  small  to  meet  the  termination  criteria  without 
further  optimization. 

It  is  important  to  emphasize  that  the  criteria  chosen  for  the  flop  count  com¬ 
parison,  especially  the  second  criterion,  actually  skew  the  results  against  the  new 
2(q-  l)-dimensional  search  method.  While  the  flop  count  savings  are  quite  substan¬ 
tial,  the  real  advantages  occur  for  systems  that  do  not  directly  converge  to  p  from 
the  starting  point.  Convergence  to  some  local  maxima  requires  the  selection  of  a  new 
starting  point  from  which  the  search  for  p  may  continue.  For  previous  methods  this 
means  a  search  over  n  —  1  dimensions  so  the  search  area  grows  exponentially  as  sys¬ 
tem  size  increases.  The  new  method,  however,  requires  a  search  over  only  a  2 (q  —  1) 
dimensional  space  greatly  increasing  the  likelihood  of  quickly  finding  p. 

8.9  Algorithm  for  Cusping  Systems 

Certainly  the  numerical  efficiency  of  supa.  p(MaUnew)  offers  advantages  even  if 
the  exact  value  of  p  must  be  found.  For  the  general  cusping  case  of  q  =  2,  the  ability 
to  generate  and  view  3-D  mesh  plots  as  in  Figure  8.4  offers  a  means  of  partially 
overcoming  the  problem  of  nonconvexity.  A  general  algorithm  for  computing  p  to 


119 


account  for  cusping  and  noncusping  systems  is  now  presented. 

1.  Compute  info  a(DMaD~l).  If  MPDA  is  established  then  a{DMab~'1)  =  p(Ma) 
and  the  algorithm  is  complete. 

2.  If  MPDA  is  not  established  then  no  stationary  point  was  found  and  the  sys¬ 
tem  is  either  a  “kiss”  or  a  “cusp.”  Determine  the  first  q  singular  vectors  of 
(DMaD~x)  where  q  is  the  multiplicity  of  a  and  compute  supa.  p(MaUnew).  If 
MPDA  is  now  established  (“kiss”)  or  p(MaUnew)  is  within  some  allowable  dis¬ 
tance  of  a(DMaD~x)  at  the  cusp  then  the  procedure  is  complete  and  upper 
bound  a(DMaD'~1)  is  used  for  p(Ma). 

3.  If  p(MaUnew )  fails  below  the  desirable  distance  from  a(DMaD~l)  then  continue 
with  the  PDA  search  over  cri{DMaD~l )  (Method  D  of  Table  8.7)  to  determine 
a  true  stationary  point. 

4.  If  the  stationary  point  from  Step  3  is  still  not  acceptable  then  either  change 
the  initial  settings  of  the  a,-  and  repeat  Steps  2  and  3  or  generate  a  mesh  of 
p(MaUnew)  over  the  a;  as  discussed  in  Section  8.4.  Then  investigate  all  local 
maxima  from  the  mesh  for  the  global  maximum  of  sup^  p(MaUd)- 

While  even  the  mesh  is  not  guaranteed  to  contain  p,  the  advantages  offered  by  the 
optimization  supa>  p(MaUnew )  are  a  tremendous  improvement  over  previous  methods 
of  computing  p  for  cusping  systems.  It  should  also  be  mentioned  that  this  approach 
for  computing  p  also  extends  to  cusping  block  structured  systems.  Equation  5.10 
reveals  the  relationships  between  the  elements  of  Xi,...,Xq  and  Yi,...,Yq  which 
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musi,  hold  even  when  cusping  occurs.  Similar  relationships  can  be  determined  for  any 
general  block  structured  system  so  forming  Xnew  as  linear  combinations  of  Xi, . . . ,  Xq 
proceeds  exactly  as  before.  The  only  difference  between  the  element-by-element  un¬ 
certainty  algorithm  and  that  for  general  block  structured  systems  is  in  the  formulation 
of  Uncw  which  must  reflect  the  associated  block  configuration.  Otherwise,  the  same 
reduction  in  optimization  variables  with  corresponding  efficiency  benefits  applies. 


CHAPTER  9 
CONCLUSION 


9.1  Summary 

This  dissertation  presents  new  results  in  the  area  of  stability  robustness  analy¬ 
sis  for  multivariable  systems.  Of  primary  significance  are  the  links  developed  between 
similarity  and  nonsimilarity  scaling  techniques  for  computing  the  structured  singular 
value  (/z).  The  results  of  Chapter  5  show  that  no  more  than  2 (n  —  1)  optimization 
variables  are  required  to  compute  //  using  either  similarity  or  nonsimilarity  scaling. 
While  the  nonsimilarity  scaling  approach  effectively  addresses  element-by-element 
structured  uncertainties,  the  ability  of  similarity  scaling  to  handle  general  block  struc¬ 
tured  uncertainties  with  no  more  than  2(n  —  I)  free  variables  greatly  simplifies  the 
calculation  of  /z  for  this  important  uncertainty  class. 

For  the  general  block  2x2  problem,  the  elimination  of  redundant  variables  leads 
to  a  guarantee  that  the  solution  of  inf Dba(DbMbD^1)  equals  /z(Mf,)  regardless  of 
whether  or  not  a  stationary  point  occurs  at  the  “inf.”  This  eliminates  the  need  to 
employ  one  of  the  computationally  intensive  lower  bound  optimization  routines  for 
this  class  in  the  event  that  the  maximum  singular  value  repeats. 

In  addition  to  the  reduction  in  optimization  variables,  the  link  between  the  two 
scaling  methods  is  shown  in  Chapter  6  to  include  a  direct  relationship  between  scaling 
matrix  D  of  similarity  scaling  and  scaling  matrices  R  and  L  of  nonsimilarity  scaling. 
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The  value  of  this  direct  relationship  is  shown  to  be  most  evident  when  applied  to 
methods  that  compute  a  lower  bound  for  /z  as  described  in  Chapter  7.  Since  such 
methods  involve  nonconvex  optimizations,  their  solutions  are  initially  only  candidates 
for  fi.  Guaranteeing  that  some  solution  actually  equals  /z  requires  showing  that  the 
candidate  and  the  upper  bound  of  7f(DMaD~l)  are  equal.  Computing  the  correspond¬ 
ing  D  for  a  particular  /z  candidate  is  straightforward;  however,  the  actual  singular 
value  decomposition  must  then  be  performed  on  D  and  Ma  matrices  with  dimensions 
as  large  as  n2  x  n2.  By  directly  transforming  D  to  the  corresponding  R  and  L ,  the 

A  A  A  A 

upper  bound  may  be  determined  using  the  n  x  n  decomposition  a(R~l  M  L~l)a(LP  R) 
with  a  substantial  reduction  in  floating  point  operations  for  each  step  in  the  required 
frequency  sweep. 

One  of  the  most  important  advantages  of  computing  the  singular  value  upper 
bound  for  /z  is  that  any  local  minimum  must  be  the  global  minimum.  As  long  as 
this  minimum  corresponds  to  a  stationary  point,  the  MPDA  theory  guarantees  that 
this  upper  bound  equals  /z.  However,  for  those  cases  that  do  not  achieve  stationarity 
at  the  minimum  (“cusps”),  /z  is  not  achieved  and,  in  fact,  the  actual  value  of  /z  may 
differ  from  the  upper  bound  by  more  than  15%.  Because  of  the  nonconvexity  involved 
in  lower  bound  calculations  and  the  dangers  of  choosing  an  optimistic  estimate  for 
/z,  the  conservative  upper  bound  is  generally  used  in  place  of  /z  when  cusping  occurs. 
This  is  particularly  true  as  system  size  increases  because  the  lower  bound  solution 
domain  increases  directly  with  n,  greatly  impeding  the  search  process. 

By  applying  the  MPDA  principal,  an  estimate  for  /z  requiring  only  2 (q  —  1)  op- 
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timization  variables  (where  q  denotes  the  multiplicity  of  the  cusp)  is  proposed  in 
Chapter  8.  The  largest  deviation  found  to  date  between  this  estimate  and  the  actual 
value  of  n  is  less  than  0.34%:  well  within  most  real-world  engineering  tolerances. 
Reductions  in  floating  point  operations  of  more  than  60%  for  9  x  9  systems  are  ex¬ 
perienced  versus  previous  methods  and  for  the  most  common  cusping  case  of  q  =  2, 
3— dimensional  mesh  plots  may  be  used  to  easily  locate  candidates  for  ft.  The  ft 
estimate  holds  for  cusping  systems  with  element-by-element  as  well  as  block  struc¬ 
tured  uncertainties  so  any  system  with  structured  uncertainties  can  benefit  from  this 
method.  Also,  the  estimate  can  serve  as  an  initial  st  -.rting  point  in  the  event  that 
ft  must  be  determined  exactly.  Such  a  procedure  employing  the  Principal  Direction 
Alignment  theorem  is  shown  in  Table  8.6  to  find  the  actual  value  of  fi  with  only  a 
slight  increase  in  floating  point  operations  over  that  required  for  the  estimate. 

9.2  Future  Directions 

9.2.1  Nonsimilarity  Scaling  with  Block  Structured  Uncertainties 

The  results  of  this  dissertation  serve  to  enhance  many  of  the  important  tools 
used  to  analyze  the  robust  stability  properties  of  multivariable  control  systems.  In 
the  process  of  completing  one  particular  objective,  other  promising  areas  frequently 
appear  as  directions  for  additional  research.  One  of  the  more  intriguing  of  such 
areas  emerging  from  this  work  is  the  possibility  of  adapting  the  nonsimilarity  scaling 
technique  to  work  with  general  block  structured  uncertainties. 

For  element- by-element  structured  uncertainties,  the  advantages  of  nonsimilarity 
scaling  are  quite  clear.  For  example,  using  the  similarity  scaling  approach,  System 
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12  of  Table  5.2  required  922,383  flops  to  compute  //  using  the  reduced  scaling  struc¬ 
ture.  For  the  same  system,  the  nonsimilarity  scaling  approach  required  only  105,061 
flops:  a  reduction  of  more  than  88%.  Unfortunately,  current  nonsimilarity  scaling 
formulations  do  not  treat  general  block  structured  uncertainties  so  the  tremendous 
advantages  cannot  be  realized  for  this  important  uncertainty  class. 

While  uncertainty  blocks  with  arbitrary  dimension  currently  prevent  the  use  of 
nonsimilarity  scaling  methods,  the  direct  relationship  between  D,  i?,  and  L  of  Chapter 
6  may  be  applied  to  at  least  one  specific  block  structure.  This  structure  requires 
only  that  all  the  blocks  be  of  a  common  size  so  that  more  than  the  standard  scalar 
uncertainty  elements  are  allowed.  To  illustrate  the  application  of  nonsimilarity  scaling 
to  this  class  of  uncertainties,  consider  the  system  of  Example  5.4  on  page  61.  A  direct 
calculation  of  R,  and  L  from  D  gives 

R=diag[ 1,  1,  0.833,  0.833],  L  -  diag[l,  1,  0.625,  0.625] 

with  a{R~xML~x)a(LPR)  =  v(DbMbDb1)  =  16.43  =  fi{Mb).  Since  the  direct  trans¬ 
formation  holds  here,  the  problem  could  have  been  solved  by  employing  nonsimilarity 
scaling  from  the  start.  The  ability  of  nonsimilarity  scaling  to  address  even  this  limited 
class  of  block  structured  uncertainties  combined  with  the  correspondence  between  op¬ 
timization  variable  requirements  suggests  that  some  general  block  uncertainty  form 
of  nonsimilarity  scaling  may  eventually  be  possible. 

9.2.2  H'M  and  //— Synthesis  Design  Methods 

Two  recent  advances  in  the  design  of  multivariable  control  systems  are  the 
H°°  and  //-synthesis  methods.  The  H°°  technique  allows  for  the  design  of  controllers 
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that  possess  optimal  performance  in  terms  of  minimizing  the  H 00  norm  of  some  fre¬ 
quency  dependent  transfer  matrix  [18].  On  the  other  hand,  ^-synthesis  extends 
the  optimal  performance  properties  of  the  H°°  method  to  encompass  robust  perfor¬ 
mance  characteristics  as  well  [19].  A  number  of  interesting  design  studies  involving 
the  n~ synthesis  approach  have  appeared  including  a  flexible  antenna  structure  and 
an  implementation  of  the  space  shuttle  lateral  axis  flight  control  system  [47,  48].  The 
next  few  paragraphs  give  a  brief  overview  of  these  techniques  along  with  possible 
applications  of  the  results  of  this  dissertation. 

For  transfer  matrix  G,  the  H°°  norm  of  G  may  be  written  as 

IIGMIU  =  sup?(G(S))  (9.1) 

where  s  again  denotes  frequency  dependence  [49].  Figure  9.1  depicts  the  standard 
H°°  system  model  with  feedback  controller  K.  Transfer  matrix  G  can  be  partitioned 
as 

G11  G12 
G  21  G22 

giving  a  set  of  system  equations  of  the  form 

z  =  G\iW  +  G12U,  y  =  G21W  +  C?22m> 

Noting  that  u  =  Ky  allows  this  to  be  rewritten  as 

z  =  [Cru  +  G\2^(I  ~  G22^)~lG2l]w  (9-A) 

Defining  F  =  [Gn  -f-  GnI((I  -  Gv2K)~lGi\\  as  the  transfer  matrix  from  external 
input,  10,  to  output  2,  Equation  9.3  can  then  be  written  as 

2  =  Fw. 
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The  design  objective  then  involves  minimizing  the  H°°  norm  of  F.  Depending  the  on 
the  performance  objective(s)  to  be  optimized,  the  choice  of  K  that  best  meets  the 
objective(s)  can  be  chosen  from  the  set  of  all  stabilizing  controllers.  Characteriza¬ 
tion  of  all  proper,  stabilizing  controllers  can  be  obtained  from  the  so-called  “Youla 
parameterization”  [50]  allowing  the  minimization  to  be  formulated  as 

=  A  11^11“-  (!W> 

This  problem  turns  out  to  be  convex  and  a  method  to  directly  solve  for  the  optimal 
controller  has  been  proposed  by  Doyle  et  al  [51]. 


External 
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Control 
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Outputs 
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Figure  9.1:  H°°  System  Representation. 


The  optimal  performance  offered  by  a  system  satisfying  Equation  9.4  depends 
on  the  adequacy  of  the  plant  model.  Uncertainty  in  this  model  may  lead  to  unpre¬ 
dictable  behavior  that  quickly  loses  optimal  and  even  acceptable  performance  levels. 
However,  by  combining  the  H°°  design  procedure  with  singular  value  scaling  tech¬ 
niques,  robust  performance  and  stability  objectives  may  be  simultaneously  met.  Such 
a  methodology  was  introduced  by  Doyle  under  the  name  “//—synthesis”  [19].  Rather 
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than  simply  minimizing  a(F(s))  as  a  function  of  frequency  and  stabilizing  controller, 
I( ,  //—synthesis  allows  for  the  incorporation  of  structured  plant  uncertainties  with 
the  design  objective  of  solving 

mminf||JDJPjD"1||00  (9.5) 

stable  A  D 

where  F  represents  the  appropriate  combination  of  F  and  A.  For  fixed  D,  Equation 
9.5  is  a  standard  H°°  problem.  Similarly  for  fixed  I(  the  problem  involves  finding 
fj,(DFD~l).  Separately,  each  problem  is  convex.  Unfortunately,  the  combined  prob¬ 
lem  loses  this  desirable  convexity  property  so  there  is  no  guarantee  of  ever  achieving 
the  global  minimum  in  terms  of  both  K  and  D. 

While  the  //-synthesis  approach  provides  controllers  with  robust  performance,  it 
requires  a  tremendous  amount  of  computing  power  to  find  even  a  local  minimum  of 
Equation  9.5.  With  the  loss  of  convexity,  computational  requirements  increase  even 
more.  However,  by  applying  the  results  of  this  dissertation,  it  may  be  possible  to 
improve  the  efficiency  of  the  //—synthesis  method. 

For  example,  the  reduction  of  optimization  variables  for  systems  with  both  scalar 
and  block  structured  uncertainties  should  directly  apply  to  that  step  of  the  procedure 
involved  in  infimizing  the  maximum  singular  value  over  D.  Also,  the  direct  relation¬ 
ship  between  similarity  scaling  matrix,  D ,  and  nonsimilarity  scaling  matrices  L  and  R 
might  provide  insight  into  a  formulation  of  //-synthesis  involving  only  nonsimilarity 
scaling  with  its  corresponding  reduced  system  size.  Finally,  the  procedure  outlined 
in  Chapter  8  for  estimating  //  with  a  repeated  maximum  singular  value  should  apply 
to  the  //—synthesis  problem  when  cusping  occurs  so  that  unnecessary  conservatism 
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can  be  reduced  without  the  huge  computing  cost  involved  with  previous  methods  of 
determining  the  lower  bound  for  \i. 

Although  the  actual  benefits  of  applying  the  results  of  this  dissertation  to  the 
/r— synthesis  design  approach  are  as  yet  unknown,  any  efficiency  advantage  over  pre¬ 
vious  methods  is  certainly  welcome.  Since  all  but  the  simplest  of  design  problems 
require  tradeoffs  between  performance  requirements,  an  iterative  process  is  normally 
employed  to  achieve  a  satisfactory  performance  compromise.  Efficiency  improvements 
as  low  as  ten  or  twenty  percent  could  therefore  translate  into  several  hours  worth  of 
CPU  savings  as  an  acceptable  performance  trade-off  is  reached. 


REFERENCES 


1.  L.  Ljung,  System  Identification,  Prentice-Hall,  Englewood  Cliffs,  NJ,  1987. 

2.  B.  Friedland,  Control  System  Design,  McGraw-Hill,  New  York,  1986. 

3  A.  Yue  and  I.  Postlethwaiie,  “Improvement  of  Helicopter  Handling  Qualities 
Using  H00— Optimisation,”  Proc.  IEE.  vol.  137,  Pt.  D,  no.  3,  May  1990,  pp. 
115-130. 

4.  F.  Neslinc  and  S.  Joshi,  “Aerospace  Control  Systems,”  IEEE  Control  Systems 
Magazine,  vol.  10,  no.  4,  June  1990,  pp.  3-4. 

5.  M.  Morari,  and  E.  Zafiriou,  Robust  Process  Control.  Prentice  Hall,  Englewood 
Cliffs,  NJ,  1989. 

6.  H.  ?*.  Black,  “Stabilized  Feed-Back  Amplifiers,”  Electrical  Engineering,  vol.  53, 
Jan.  1934,  pp.  114-120. 

7.  H.  S.  Black,  “Inventing  the  Negative  Feedback  Amplifier,”  IEEE  Spectrum, 
Dec.  1977,  pp.  55-60. 

8.  H.  Nyquist,  “Regeneration  Theory,”  Bell  Systems  Tech.  J.,  vol.  11,  1932,  pp. 
126-147. 

9.  H.  Bode,  Network  Analysis  and  Feedback  Amplifier  Design,  Van  Nostrand,  New 
York,  1945. 

10.  P.  Dorato,  “Robust  Control:  A  Historical  Review,”  Proc.  IEEE  25th  CPC.. 
1986,  pp.  346-349. 

11.  A.  MacFarlane,  and  I.  Postlethwaite,  ”The  Generalised  Nyquist  Stability  Crite¬ 
rion  and  Multivariable  Root- Loci”,  Int.  J.  Control,  vol.  25,  1977,  pp.  81-127. 

12.  J.  Doyle  and  G.  Stein,  “Multivariable  Feedback  Design:  Concept  for  a  Classi¬ 
cal/Modern  Synthesis,”  IEEE  Trans  AC.  vol.  AC-26, 1981,  pp.  4-16. 

13.  “Challenges  to  Control:  A  Collective  View,”  IEEE  Trans  AC.  vol.  AC-32, 1987, 
pp.  275-285. 

14.  M.  Safonov,  “Stability  Margins  of  Diagonally  Perturbed  Multivariable  Feedback 
Systems,”  IEE  Proc..  vol.  129,  Pt.  D,  no.  6,  Nov.  1982,  pp.  251-  256. 


129 


130 


15.  J.  Doyle,  “Analysis  of  Feedback  Systems  with  Structured  Uncertainties,” 

Proc.  IEE.  vol.  129,  Pt.  D,  no.  6,  Nov.  1982,  pp.  242-250. 

16.  B.  Kouvaritakis  and  H.  Latchman,  “Singular  Value  and  Eigenvalue  Techniques 
in  the  Analysis  of  Systems  with  Structured  Perturbations,”  Int.  J.  Control,  vol. 
41,  no.  6,  1985,  pp.  1381-1412. 

17.  B.  Kouvaritakis  and  H.  Latchman,  “Necessary  and  Sufficient  Stability  Crite¬ 
rion  for  Systems  with  Structured  Uncertainties:  The  Major  Principal  Direction 
Alignment  Principle,”  Int.  J.  Control,  vol.  42,  no.  3,  1985,  pp.  575-598. 

18.  G.  Zames,  “Feedback  and  Optimal  Sensitivity:  Model  Reference  Transforma¬ 
tions,  Multiplicative  Seminorms,  and  Approximate  Inverses,”  IEEE  Trans  AC. 
vol.  AC-26,  1981,  pp.  301-320. 

19.  J.  Doyle,  “Structured  Uncertainty  in  Control  System  Design,”  Proc.  24t/l  CPC.. 
Ft  Lauderdale,  FL,  Dec.  1985,  pp.  260-265. 

20.  Y.  Foo,  “Robustness  of  Multivariable  Feedback  Systems:  Analysis  and  Optimal 
Design,”  D.  Phil.  Thesis,  Oxford  University,  Oxford,  England,  1985. 

21.  J.  M.  Maciejowski,  Multivariable  Feedback  Design,  Addison- Wesley  Publishing 
Company,  Wokingham  England,  1989. 

22.  R.  Smith  and  J.  Doyle,  “Model  Invalidation:  A  Connection  between  Robust 
Control  and  Identification,”  Proc.  ACC.  1989,  pp.  1435-1440. 

23.  M.  Fan,  A.  Tits  and  J.  Doyle,  “Robustness  in  the  Presence  of  Joint  Parametric 
Uncertainty  and  Unmodeled  Dynamics,”  Proc.  ACC.  1988,  pp.  1195-1200. 

24.  R.  De  Gaston  and  M.  Safonov,  “Exact  Calculation  of  the  Multiloop  Stability 
Margin,”  IEEE  Trans  AC.  vol.  AC-33,  no.  2,  Feb.  1988,  pp.  156-171. 

25.  R.  Pena  and  A.  Sideris,  “Robustness  with  Real  Parametric  and  Structured 
Complex  Uncertainty,”  Proc.  IEEE  27tk  CPC..  1988,  pp.  532-537. 

26.  P.  Young  and  J.  Doyle,  “Computation  of  the  Structured  Singular  Value  with 
Real  and  Complex  Uncertainties,”  Proc.  IEEE  29th  CPC..  1990. 

27.  H.  Latchman,  ’’Frequency  Response  Methods  for  Uncertain  Multivariable  Sys¬ 
tems”,  _D,_PhiL_Thesis,  Oxford  University,  1986. 

28.  M  Safonov  and  J  Doyle,  “Optimal  Scaling  for  Multivariable  Stability  Margin 
Singular  Value  Computation,”  MECO/EES  ’83  Symposium,  Athens,  Greece, 
Aug.  29-Sep.  2,  1983,  pp.  466-469. 

29.  R.  Sezginer  and  M.  Overton,  “The  Largest  Singular  Value  of  exA0e~x  is  Convex 
on  Convex  Sets  of  Commuting  Matrices,”  IEEE  Trans  AC.  vol.  AC-vol.  35, 
no.  2,  Feb.  1990,  pp.  229-230. 


131 


30.  E.  Osborne,  “On  Pre-Conditioning  of  Matrices,”  J.  Assoc.  Comput.  Mach., 
1960,  vol.  7,  pp.  338-345. 

31.  R.  Daniel,  B.  Kouvaritakis,  and  H.  Latchman,  “Principal  Direction  Align¬ 
ment:  A  Geometric  Framework  for  the  Complete  Solution  to  the  //-Problem,” 
IEE  Proc..  vol.  133,  Pt.  D,  no.  2,  Mar.  1986,  pp.  45-56. 

32.  PC-MATLAB  User’s  Guide.  The  Math  Works  Inc.,  South  Natick,  MA,  1989. 

33.  A.  Grace,  “Computer-Aided  Control  System  Design  Using  Optimization  Tech¬ 
niques,”  Ph.D.  Thesis,  University  of  Wales,  Bangor,  England,  1989. 

34.  W.  Press,  Numerical  Recipes:  The  Art  of  Scientific  Computing,  Cambridge  Uni- 
versity  Press,  New  York,  NY,  1986. 

35.  J.  Doyle  and  C.  Chu,  “Robustness  Control  of  Multivariable  and  Large  Scale  Sys¬ 
tems,”  Final  Technical  Report  for  AFOSR,  Contract  F49620-84-C-0088,  Mar. 
1986. 

36.  N.  Tsing,  “Convexity  of  the  Largest  Singular  Value  of  eDMe~D:  A  Convexity 
Lemma,”  IEEE  Trans  AC.  vol.  AC-35,  no.  6,  June  1990,  pp.  748-749. 

37.  M.  K.  H.  Fan  and  A.  L.  Tits,  “A  New  Formula  for  the  Structured  Singular 
Value,”  Proc.  IEEE  24*A  CPC..  1985,  pp.  595-596. 

38.  M.  K.  II.  Fan  and  A.  L.  Tits,  “Characterization  and  Efficient  Computation 
of  the  Structured  Singular  Value,”  IEEE  Trans  AC.  vol.  AC-  31,  no.  8,  pp. 
734-743,  Aug.  1986. 

39.  “Harwell  Subroutine  Library,”  Library  Reference  Manual,  Harwell,  England, 
1982. 

40.  The  //-Analysis  and  Synthesis  Toolbox,  The  MathWorks  Inc.,  South  Natick, 
MA,  1990. 

41.  A.  Packard,  M.  Fan,  and  J.  Doyle,  “A  Power  Method  for  the  Structured  Singular 
Value,”  P:oc.  IEEE  27th  CPC.  1988,  pp.  2132-2137. 

42.  G.  Golub  and  C.  Van  Loan,  Matrix  Computations,  The  John  Hopkins  Univer¬ 
sity  Press,  Baltimore,  Maryland,  1983. 

43.  A.  Packard  and  J.  Doyle,  “Structured  Singular  Value  with  Repeated  Scalar 
Blocks,”  Proc.  ACC.  1988,  pp.  1213-1218. 

44.  Y.  Hung,  A.  MacFarlane,  Multivariable  Feedback:  A  Quasi-Classical  Approach, 
Springer- Verlag,  Berlin,  1982. 

45.  G.  Stewart,  “Error  and  Perturbation  Bounds  for  Subspaces  Associated  with 
Certain  Eigenvalue  Problems,”  SIAM  Review,  vol.  15,  Oct.,  1973,  pp.  727- 
764. 


132 


46.  P.  Young,  “Frequency  Domain  Analysis  of  Multivariable  Systems  with  Struc¬ 
tured  Uncertainties,”  Master’s  Thesis,  University  of  Florida,  1988. 

47.  B.  Balas,  C.  Chu,  J.  Doyle,  “Vibration  Damping  and  Robust  Control  of  the 
JPL/AFAL  Experiment  Using  /^—Synthesis,”  Proc.  IEEE  28*A  CPC..  1989,  pp. 
2689-2694. 

48.  J.  Doyle,  K.  Lenz,  and  A.  Packard  “Design  Examples  Using  /^-Synthesis:  Space 
Shuttle  Lateral  Axis  FCS  During  Reentry,”  Modelling,  Robustness  and  Sensiti¬ 
vity  Reduction  in  Control  Systems,  Springer- Verlag,  Berlin,  1987. 

49.  B.  Francis,  A  Course  in  Control  Theory,  Springer- Verlag,  Berlin,  1987. 

50.  D.  Youla,  H.  Jabr,  and  J.  Bongiorno,  “Modern  Weiner-Hopf  Design  of  Optimal 
Controllers,  Part  II:  The  Multivariable  Case,”  IEEE  Trans  AC.  vol.  AC-21, 
pp.  319-338,  1976. 

51.  J.  Doyle,  K.  Glover,  P.  Khargonekar,  and  B.  Francis,  “State-Space  Solutions  to 
Standard  II2  and  H-Infinity  Control  Problems,”  Proc.  ACC.  1988,  pp.  1691- 
1696. 


BIOGRAPHICAL  SKETCH 


Robert  Jeffrey  Norris  was  born  on  October  13,  1959,  in  Morebead  City,  North 
Carolina.  He  graduated  summa  cum  laude  from  North  Carolina  State  University  in 
1981  with  a  degree  in  electrical  engineering.  He  then  entered  graduate  school  at  North 
Carolina  State  University  and  received  a  master’s  degree  in  electrical  engineering  in 
1983.  After  completing  his  master’s  degree  he  received  a  commission  in  the  United 
States  Air  Force  where  his  work  involved  the  guidance  and  control  of  remotely  piloted 
vehicles.  In  1987  he  was  selected  for  an  Air  Force  fellowship  to  study  electrical 
engineering  at  the  University  of  Florida. 


133 


